mirror of
https://github.com/gryf/coach.git
synced 2025-12-17 19:20:19 +01:00
RL in Large Discrete Action Spaces - Wolpertinger Agent (#394)
* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter
This commit is contained in:
@@ -202,6 +202,7 @@
|
||||
<li><a href="rl_coach/agents/soft_actor_critic_agent.html">rl_coach.agents.soft_actor_critic_agent</a></li>
|
||||
<li><a href="rl_coach/agents/td3_agent.html">rl_coach.agents.td3_agent</a></li>
|
||||
<li><a href="rl_coach/agents/value_optimization_agent.html">rl_coach.agents.value_optimization_agent</a></li>
|
||||
<li><a href="rl_coach/agents/wolpertinger_agent.html">rl_coach.agents.wolpertinger_agent</a></li>
|
||||
<li><a href="rl_coach/architectures/architecture.html">rl_coach.architectures.architecture</a></li>
|
||||
<li><a href="rl_coach/architectures/network_wrapper.html">rl_coach.architectures.network_wrapper</a></li>
|
||||
<li><a href="rl_coach/base_parameters.html">rl_coach.base_parameters</a></li>
|
||||
|
||||
Reference in New Issue
Block a user