mirror of
https://github.com/gryf/coach.git
synced 2026-04-20 15:11:24 +02:00
pre-release 0.10.0
This commit is contained in:
@@ -0,0 +1,21 @@
|
||||
# Quantile Regression DQN
|
||||
|
||||
Each experiment uses 3 seeds and is trained for 10k environment steps.
|
||||
The parameters used for QR-DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1710.10044.pdf).
|
||||
|
||||
### Breakout QR-DQN - single worker
|
||||
|
||||
```bash
|
||||
python3 coach.py -p Atari_QR_DQN -lvl breakout
|
||||
```
|
||||
|
||||
<img src="breakout_qr_dqn.png" alt="Breakout QR-DQN" width="800"/>
|
||||
|
||||
|
||||
### Pong QR-DQN - single worker
|
||||
|
||||
```bash
|
||||
python3 coach.py -p Atari_QR_DQN -lvl pong
|
||||
```
|
||||
|
||||
<img src="pong_qr_dqn.png" alt="Pong QR-DQN" width="800"/>
|
||||
Binary file not shown.
|
After Width: | Height: | Size: 118 KiB |
Binary file not shown.
|
After Width: | Height: | Size: 90 KiB |
Reference in New Issue
Block a user