coach/rl_coach/agents/value_optimization_agent.py at 6e08c55ad5cb7a61fa997973d5d8f9cee2f9f864

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00

Files

Gal Leibovich 6e08c55ad5 Enabling-more-agents-for-Batch-RL-and-cleanup (#258 )

allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups

2019-03-21 16:10:29 +02:00

7.9 KiB

Raw Blame History

View Raw

7.9 KiB Raw Blame History

7.9 KiB

Raw Blame History