1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00

pre-release 0.10.0

This commit is contained in:
Gal Novik
2018-08-13 17:11:34 +03:00
parent d44c329bb8
commit 19ca5c24b1
485 changed files with 33292 additions and 16770 deletions

View File

@@ -0,0 +1,31 @@
# Dueling DDQN with Prioritized Experience Replay
Each experiment uses 3 seeds and is trained for 10k environment steps.
The parameters used for Dueling DDQN with PER are the same parameters as described in the [following paper](https://arxiv.org/abs/1511.05952).
### Breakout Dueling DDQN with PER - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl breakout
```
<img src="breakout_dueling_ddqn_with_per.png" alt="Breakout Dueling DDQN with PER" width="800"/>
### Pong Dueling DDQN with PER - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl pong
```
<img src="pong_dueling_ddqn_with_per.png" alt="Pong Dueling DDQN with PER" width="800"/>
### Space Invaders Dueling DDQN with PER - single worker
```bash
python3 coach.py -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl space_invaders
```
<img src="space_invaders_dueling_ddqn_with_per.png" alt="Space Invaders Dueling DDQN with PER" width="800"/>