1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00

pre-release 0.10.0

This commit is contained in:
Gal Novik
2018-08-13 17:11:34 +03:00
parent d44c329bb8
commit 19ca5c24b1
485 changed files with 33292 additions and 16770 deletions

View File

@@ -0,0 +1,40 @@
# DDPG with Hindsight Experience Replay
Each experiment uses 3 seeds.
The parameters used for DDPG HER are the same parameters as described in the [following paper](https://arxiv.org/abs/1802.09464).
### Fetch Reach DDPG HER - single worker
```bash
python3 coach.py -p Fetch_DDPG_HER_baselines -lvl reach
```
<img src="fetch_ddpg_her_reach_1_worker.png" alt="Fetch DDPG HER Reach 1 Worker" width="800"/>
### Fetch Push DDPG HER - 8 workers
```bash
python3 coach.py -p Fetch_DDPG_HER_baselines -lvl push -n 8
```
<img src="fetch_ddpg_her_push_8_workers.png" alt="Fetch DDPG HER Push 8 Worker" width="800"/>
### Fetch Slide DDPG HER - 8 workers
```bash
python3 coach.py -p Fetch_DDPG_HER_baselines -lvl slide -n 8
```
<img src="fetch_ddpg_her_slide_8_workers.png" alt="Fetch DDPG HER Slide 8 Worker" width="800"/>
### Fetch Pick And Place DDPG HER - 8 workers
```bash
python3 coach.py -p Fetch_DDPG_HER -lvl pick_and_place -n 8
```
<img src="fetch_ddpg_her_pick_and_place_8_workers.png" alt="Fetch DDPG HER Pick And Place 8 Workers" width="800"/>