1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 03:30:19 +01:00
This commit is contained in:
Gal Leibovich
2019-06-16 11:11:21 +03:00
committed by GitHub
parent 8df3c46756
commit 7eb884c5b2
107 changed files with 2200 additions and 495 deletions

48
benchmarks/td3/README.md Normal file
View File

@@ -0,0 +1,48 @@
# Twin Delayed DDPG
Each experiment uses 5 seeds and is trained for 1M environment steps.
The parameters used for TD3 are the same parameters as described in the [original paper](https://arxiv.org/pdf/1802.09477.pdf), and [repository](https://github.com/sfujim/TD3).
### Ant TD3 - single worker
```bash
coach -p Mujoco_TD3 -lvl ant
```
<img src="ant.png" alt="Ant TD3" width="800"/>
### Hopper TD3 - single worker
```bash
coach -p Mujoco_TD3 -lvl hopper
```
<img src="hopper.png" alt="Hopper TD3" width="800"/>
### Half Cheetah TD3 - single worker
```bash
coach -p Mujoco_TD3 -lvl half_cheetah
```
<img src="half_cheetah.png" alt="Half Cheetah TD3" width="800"/>
### Reacher TD3 - single worker
```bash
coach -p Mujoco_TD3 -lvl reacher
```
<img src="reacher.png" alt="Reacher TD3" width="800"/>
### Walker2D TD3 - single worker
```bash
coach -p Mujoco_TD3 -lvl walker2d
```
<img src="walker2d.png" alt="Walker2D TD3" width="800"/>

BIN
benchmarks/td3/ant.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 88 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 72 KiB

BIN
benchmarks/td3/hopper.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 113 KiB

BIN
benchmarks/td3/reacher.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 96 KiB

BIN
benchmarks/td3/walker2d.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 92 KiB