rl_define_policy.RdDefine the Action-Selection Policy for an RL Agent
rl_define_policy(policy, ...)What policy should a decision be made under? Currently supported are softmax, greedy, and epsilon-greedy.
Additional arguments passed to or from specific methods, such as
tau when policy = "softmax" and epsilon when policy = "epsilonGreedy".
The agent object with modified policy structure.
# Example arguments include:
# (policy = "softmax", tau = 7)
# (policy = "epsilonGreedy", epsilon = 0.4)
# (policy = "greedy")