1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 19:50:17 +01:00
Files
coach/docs_raw/source/components/exploration_policies/index.rst
Itai Caspi 6d40ad1650 update of api docstrings across coach and tutorials [WIP] (#91)
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00

3.2 KiB

Exploration policies are a component that allow the agent to tradeoff exploration and exploitation according to a predefined policy. This is one of the most important aspects of reinforcement learning agents, and can require some tuning to get it right. Coach supports several pre-defined exploration policies, and it can be easily extended with custom policies. Note that not all exploration policies are expected to work for both discrete and continuous action spaces.

Exploration Policy

Discrete Action Space

Box Action Space

AdditiveNoise

X

V

Boltzmann

V

X

Bootstrapped

V

X

Categorical

V

X

ContinuousEntropy

X

V

EGreedy

V

V

Greedy

V

V

OUProcess

X

V

ParameterNoise

V

V

TruncatedNormal

X

V

UCB

V

X

ExplorationPolicy

AdditiveNoise

Boltzmann

Bootstrapped

Categorical

ContinuousEntropy

EGreedy

Greedy

OUProcess

ParameterNoise

TruncatedNormal

UCB