mirror of
https://github.com/gryf/coach.git
synced 2026-03-01 14:15:46 +01:00
pre-release 0.10.0
This commit is contained in:
21
benchmarks/qr_dqn/README.md
Normal file
21
benchmarks/qr_dqn/README.md
Normal file
@@ -0,0 +1,21 @@
|
||||
# Quantile Regression DQN
|
||||
|
||||
Each experiment uses 3 seeds and is trained for 10k environment steps.
|
||||
The parameters used for QR-DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1710.10044.pdf).
|
||||
|
||||
### Breakout QR-DQN - single worker
|
||||
|
||||
```bash
|
||||
python3 coach.py -p Atari_QR_DQN -lvl breakout
|
||||
```
|
||||
|
||||
<img src="breakout_qr_dqn.png" alt="Breakout QR-DQN" width="800"/>
|
||||
|
||||
|
||||
### Pong QR-DQN - single worker
|
||||
|
||||
```bash
|
||||
python3 coach.py -p Atari_QR_DQN -lvl pong
|
||||
```
|
||||
|
||||
<img src="pong_qr_dqn.png" alt="Pong QR-DQN" width="800"/>
|
||||
BIN
benchmarks/qr_dqn/breakout_qr_dqn.png
Normal file
BIN
benchmarks/qr_dqn/breakout_qr_dqn.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 118 KiB |
BIN
benchmarks/qr_dqn/pong_qr_dqn.png
Normal file
BIN
benchmarks/qr_dqn/pong_qr_dqn.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 90 KiB |
Reference in New Issue
Block a user