mirror of
https://github.com/gryf/coach.git
synced 2025-12-17 11:10:20 +01:00
Updating PPO references per issue #11
This commit is contained in:
@@ -2,7 +2,7 @@
|
||||
|
||||
**Actions space:** Discrete|Continuous
|
||||
|
||||
**References:** [Emergence of Locomotion Behaviours in Rich Environments](https://arxiv.org/pdf/1707.02286.pdf)
|
||||
**References:** [Proximal Policy Optimization Algorithms](https://arxiv.org/pdf/1707.06347.pdf)
|
||||
|
||||
## Network Structure
|
||||
|
||||
|
||||
Reference in New Issue
Block a user