rl_define_policy.Rd
Define the Action-Selection Policy for an RL Agent
rl_define_policy(policy, ...)
What policy should a decision be made under? Currently supported are softmax, greedy, and epsilon-greedy.
Additional arguments passed to or from specific methods, such as
tau
when policy = "softmax"
and epsilon
when policy = "epsilonGreedy"
.
The agent object with modified policy structure.
# Example arguments include:
# (policy = "softmax", tau = 7)
# (policy = "epsilonGreedy", epsilon = 0.4)
# (policy = "greedy")