mirror of
https://github.com/gryf/coach.git
synced 2025-12-18 11:40:18 +01:00
Docs changes - fixing blogpost links, removing importing all exploration policies (#139)
* updated docs * removing imports for all exploration policies in __init__ + setting the right blog-post link * small cleanups
This commit is contained in:
committed by
Scott Leishman
parent
155b78b995
commit
f12857a8c7
@@ -209,7 +209,7 @@
|
||||
<span class="k">return</span> <span class="s1">'rl_coach.exploration_policies.greedy:Greedy'</span>
|
||||
|
||||
|
||||
<div class="viewcode-block" id="Greedy"><a class="viewcode-back" href="../../../components/exploration_policies/index.html#rl_coach.exploration_policies.Greedy">[docs]</a><span class="k">class</span> <span class="nc">Greedy</span><span class="p">(</span><span class="n">ExplorationPolicy</span><span class="p">):</span>
|
||||
<div class="viewcode-block" id="Greedy"><a class="viewcode-back" href="../../../components/exploration_policies/index.html#rl_coach.exploration_policies.greedy.Greedy">[docs]</a><span class="k">class</span> <span class="nc">Greedy</span><span class="p">(</span><span class="n">ExplorationPolicy</span><span class="p">):</span>
|
||||
<span class="sd">"""</span>
|
||||
<span class="sd"> The Greedy exploration policy is intended for both discrete and continuous action spaces.</span>
|
||||
<span class="sd"> For discrete action spaces, it always selects the action with the maximum value, as given by the agent.</span>
|
||||
|
||||
Reference in New Issue
Block a user