1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 03:30:19 +01:00
Files
coach/benchmarks/dueling_ddqn/README.md
2018-08-19 14:23:20 +03:00

769 B

Dueling DDQN

Each experiment uses 3 seeds and is trained for 10k environment steps. The parameters used for Dueling DDQN are the same parameters as described in the original paper.

Pong Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl pong
Pong Dueling DDQN

Breakout Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl breakout
Breakout Dueling DDQN

Space Invaders Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl space_invaders
Space Invaders Dueling DDQN