1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00
Files
coach/benchmarks/dueling_ddqn/README.md
2018-08-13 17:11:34 +03:00

15 lines
411 B
Markdown

# Dueling DDQN
Each experiment uses 3 seeds and is trained for 10k environment steps.
The parameters used for Dueling DDQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1706.01502).
### Breakout Dueling DDQN - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN -lvl breakout
```
<img src="breakout_dueling_ddqn.png" alt="Breakout Dueling DDQN" width="800"/>