1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00
Files
coach/benchmarks/qr_dqn
2018-08-13 17:11:34 +03:00
..
2018-08-13 17:11:34 +03:00
2018-08-13 17:11:34 +03:00
2018-08-13 17:11:34 +03:00

Quantile Regression DQN

Each experiment uses 3 seeds and is trained for 10k environment steps. The parameters used for QR-DQN are the same parameters as described in the original paper.

Breakout QR-DQN - single worker

python3 coach.py -p Atari_QR_DQN -lvl breakout
Breakout QR-DQN

Pong QR-DQN - single worker

python3 coach.py -p Atari_QR_DQN -lvl pong
Pong QR-DQN