coach/docs_raw/source/components/exploration_policies/index.rst at f12857a8c78ea6ddd315126dd2f42e1e03c72c71 - coach - code

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00

Files

Gal Leibovich f12857a8c7 Docs changes - fixing blogpost links, removing importing all exploration policies (#139 )

* updated docs

* removing imports for all exploration policies in __init__ + setting the right blog-post link

* small cleanups

2018-12-05 16:16:16 -05:00

3.3 KiB

Raw Blame History

Exploration policies are a component that allow the agent to tradeoff exploration and exploitation according to a predefined policy. This is one of the most important aspects of reinforcement learning agents, and can require some tuning to get it right. Coach supports several pre-defined exploration policies, and it can be easily extended with custom policies. Note that not all exploration policies are expected to work for both discrete and continuous action spaces.

Exploration Policy	Discrete Action Space	Box Action Space
AdditiveNoise	X	V
Boltzmann	V	X
Bootstrapped	V	X
Categorical	V	X
ContinuousEntropy	X	V
EGreedy	V	V
Greedy	V	V
OUProcess	X	V
ParameterNoise	V	V
TruncatedNormal	X	V
UCB	V	X

ExplorationPolicy

System Message: ERROR/3 (<string>, line 41)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.exploration_policy.ExplorationPolicy
   :members:
   :inherited-members:

AdditiveNoise

System Message: ERROR/3 (<string>, line 47)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.additive_noise.AdditiveNoise

Boltzmann

System Message: ERROR/3 (<string>, line 51)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.boltzmann.Boltzmann

Bootstrapped

System Message: ERROR/3 (<string>, line 55)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.bootstrapped.Bootstrapped

Categorical

System Message: ERROR/3 (<string>, line 59)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.categorical.Categorical

ContinuousEntropy

System Message: ERROR/3 (<string>, line 63)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.continuous_entropy.ContinuousEntropy

EGreedy

System Message: ERROR/3 (<string>, line 67)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.e_greedy.EGreedy

Greedy

System Message: ERROR/3 (<string>, line 71)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.greedy.Greedy

OUProcess

System Message: ERROR/3 (<string>, line 75)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.ou_process.OUProcess

ParameterNoise

System Message: ERROR/3 (<string>, line 79)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.parameter_noise.ParameterNoise

TruncatedNormal

System Message: ERROR/3 (<string>, line 83)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.truncated_normal.TruncatedNormal

UCB

System Message: ERROR/3 (<string>, line 87)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.ucb.UCB