coach/docs_raw/source/components/exploration_policies/index.rst at 67eb9e4c28098d93ac122d65833c20b22b7e86c7 - coach - code

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-18 19:50:17 +01:00

Files

Itai Caspi 6d40ad1650 update of api docstrings across coach and tutorials [WIP] (#91 )

* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation

2018-11-15 15:00:13 +02:00

3.2 KiB

Raw Blame History

Exploration policies are a component that allow the agent to tradeoff exploration and exploitation according to a predefined policy. This is one of the most important aspects of reinforcement learning agents, and can require some tuning to get it right. Coach supports several pre-defined exploration policies, and it can be easily extended with custom policies. Note that not all exploration policies are expected to work for both discrete and continuous action spaces.

Exploration Policy	Discrete Action Space	Box Action Space
AdditiveNoise	X	V
Boltzmann	V	X
Bootstrapped	V	X
Categorical	V	X
ContinuousEntropy	X	V
EGreedy	V	V
Greedy	V	V
OUProcess	X	V
ParameterNoise	V	V
TruncatedNormal	X	V
UCB	V	X

ExplorationPolicy

System Message: ERROR/3 (<string>, line 41)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.ExplorationPolicy
   :members:
   :inherited-members:

AdditiveNoise

System Message: ERROR/3 (<string>, line 47)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.AdditiveNoise

Boltzmann

System Message: ERROR/3 (<string>, line 51)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.Boltzmann

Bootstrapped

System Message: ERROR/3 (<string>, line 55)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.Bootstrapped

Categorical

System Message: ERROR/3 (<string>, line 59)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.Categorical

ContinuousEntropy

System Message: ERROR/3 (<string>, line 63)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.ContinuousEntropy

EGreedy

System Message: ERROR/3 (<string>, line 67)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.EGreedy

Greedy

System Message: ERROR/3 (<string>, line 71)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.Greedy

OUProcess

System Message: ERROR/3 (<string>, line 75)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.OUProcess

ParameterNoise

System Message: ERROR/3 (<string>, line 79)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.ParameterNoise

TruncatedNormal

System Message: ERROR/3 (<string>, line 83)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.TruncatedNormal

UCB

System Message: ERROR/3 (<string>, line 87)

Unknown directive type "autoclass".

.. autoclass:: rl_coach.exploration_policies.UCB