1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00
Files
coach/benchmarks/dqn
2018-08-19 11:02:45 +03:00
..
2018-08-19 11:02:45 +03:00

DQN

Each experiment uses 3 seeds. The parameters used for DQN are the same parameters as described in the original paper, except for the optimizer (changed to ADAM) and learning rate (1e-4) used.

Breakout DQN - single worker

python3 coach.py -p Atari_DQN -lvl breakout
Breakout DQN

Pong DQN - single worker

python3 coach.py -p Atari_DQN -lvl pong
Pong DQN

Space Invaders DQN - single worker

python3 coach.py -p Atari_DQN -lvl space_invaders
Space Invaders DQN