1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-01 03:22:32 +01:00

benchmarks and pip package updates

This commit is contained in:
Itai Caspi
2018-08-19 14:23:20 +03:00
parent 23d2945bf8
commit c5165cd7d6
14 changed files with 69 additions and 48 deletions

View File

@@ -3,12 +3,33 @@
Each experiment uses 3 seeds and is trained for 10k environment steps.
The parameters used for Dueling DDQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1706.01502).
### Pong Dueling DDQN - single worker
```bash
coach -p Atari_Dueling_DDQN -lvl pong
```
<img src="pong_dueling_ddqn.png" alt="Pong Dueling DDQN" width="800"/>
### Breakout Dueling DDQN - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN -lvl breakout
coach -p Atari_Dueling_DDQN -lvl breakout
```
<img src="breakout_dueling_ddqn.png" alt="Breakout Dueling DDQN" width="800"/>
### Space Invaders Dueling DDQN - single worker
```bash
coach -p Atari_Dueling_DDQN -lvl space_invaders
```
<img src="space_invaders_dueling_ddqn.png" alt="Space Invaders Dueling DDQN" width="800"/>

Binary file not shown.

Before

Width:  |  Height:  |  Size: 131 KiB

After

Width:  |  Height:  |  Size: 84 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 70 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 79 KiB