1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-19 16:25:52 +01:00

benchmarks and pip package updates

This commit is contained in:
Itai Caspi
2018-08-19 14:23:20 +03:00
parent 23d2945bf8
commit c5165cd7d6
14 changed files with 69 additions and 48 deletions

View File

@@ -3,12 +3,33 @@
Each experiment uses 3 seeds and is trained for 10k environment steps.
The parameters used for Dueling DDQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1706.01502).
### Pong Dueling DDQN - single worker
```bash
coach -p Atari_Dueling_DDQN -lvl pong
```
<img src="pong_dueling_ddqn.png" alt="Pong Dueling DDQN" width="800"/>
### Breakout Dueling DDQN - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN -lvl breakout
coach -p Atari_Dueling_DDQN -lvl breakout
```
<img src="breakout_dueling_ddqn.png" alt="Breakout Dueling DDQN" width="800"/>
### Space Invaders Dueling DDQN - single worker
```bash
coach -p Atari_Dueling_DDQN -lvl space_invaders
```
<img src="space_invaders_dueling_ddqn.png" alt="Space Invaders Dueling DDQN" width="800"/>