Exploration policies are a component that allow the agent to tradeoff exploration and exploitation according to a
predefined policy. This is one of the most important aspects of reinforcement learning agents, and can require some
tuning to get it right. Coach supports several pre-defined exploration policies, and it can be easily extended with
custom policies. Note that not all exploration policies are expected to work for both discrete and continuous action
spaces.
Exploration Policy
Discrete Action Space
Box Action Space
AdditiveNoise
X
V
Boltzmann
V
X
Bootstrapped
V
X
Categorical
V
X
ContinuousEntropy
X
V
EGreedy
V
V
Greedy
V
V
OUProcess
X
V
ParameterNoise
V
V
TruncatedNormal
X
V
UCB
V
X
ExplorationPolicy
System Message: ERROR/3 (<string> , line 41)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.exploration_policy.ExplorationPolicy
:members:
:inherited-members:
AdditiveNoise
System Message: ERROR/3 (<string> , line 47)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.additive_noise.AdditiveNoise
Boltzmann
System Message: ERROR/3 (<string> , line 51)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.boltzmann.Boltzmann
Bootstrapped
System Message: ERROR/3 (<string> , line 55)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.bootstrapped.Bootstrapped
Categorical
System Message: ERROR/3 (<string> , line 59)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.categorical.Categorical
ContinuousEntropy
System Message: ERROR/3 (<string> , line 63)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.continuous_entropy.ContinuousEntropy
EGreedy
System Message: ERROR/3 (<string> , line 67)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.e_greedy.EGreedy
Greedy
System Message: ERROR/3 (<string> , line 71)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.greedy.Greedy
OUProcess
System Message: ERROR/3 (<string> , line 75)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.ou_process.OUProcess
ParameterNoise
System Message: ERROR/3 (<string> , line 79)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.parameter_noise.ParameterNoise
TruncatedNormal
System Message: ERROR/3 (<string> , line 83)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.truncated_normal.TruncatedNormal
UCB
System Message: ERROR/3 (<string> , line 87)
Unknown directive type "autoclass".
.. autoclass:: rl_coach.exploration_policies.ucb.UCB