1
0
mirror of https://github.com/gryf/coach.git synced 2026-03-01 14:15:46 +01:00

pre-release 0.10.0

This commit is contained in:
Gal Novik
2018-08-13 17:11:34 +03:00
parent d44c329bb8
commit 19ca5c24b1
485 changed files with 33292 additions and 16770 deletions

View File

@@ -0,0 +1,21 @@
# Quantile Regression DQN
Each experiment uses 3 seeds and is trained for 10k environment steps.
The parameters used for QR-DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1710.10044.pdf).
### Breakout QR-DQN - single worker
```bash
python3 coach.py -p Atari_QR_DQN -lvl breakout
```
<img src="breakout_qr_dqn.png" alt="Breakout QR-DQN" width="800"/>
### Pong QR-DQN - single worker
```bash
python3 coach.py -p Atari_QR_DQN -lvl pong
```
<img src="pong_qr_dqn.png" alt="Pong QR-DQN" width="800"/>

Binary file not shown.

After

Width:  |  Height:  |  Size: 118 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 90 KiB