Gal Leibovich
|
a112ee69f6
|
Save filters' internal state (#127)
* save filters internal state
* moving the restore to be made from within NumpyRunningStats
|
2018-11-20 17:21:48 +02:00 |
|
Gal Leibovich
|
ce85c8e8c3
|
Removing Egreedy from CartPole_ClippedPPO. ClippedPPO's default exploration policy is to be used instead. (#115)
|
2018-11-18 16:36:34 +02:00 |
|
Gal Leibovich
|
6caf721d1c
|
Numpy shared running stats (#97)
|
2018-11-18 14:46:40 +02:00 |
|
Balaji Subramaniam
|
a849c17e46
|
Enable distributed SharedRunningStats (#81)
- Use Redis pub/sub for updating SharedRunningStats.
|
2018-11-13 19:17:38 +02:00 |
|
Ajay Deshpande
|
875d6ef017
|
Adding target reward and target sucess (#58)
* Adding target reward
* Adding target successs
* Addressing comments
* Using custom_reward_threshold and target_success_rate
* Adding exit message
* Moving success rate to environment
* Making target_success_rate optional
|
2018-11-12 15:03:43 -08:00 |
|
Gal Leibovich
|
49dea39d34
|
N-step returns for rainbow (#67)
* n_step returns for rainbow
* Rename CartPole_PPO -> CartPole_ClippedPPO
|
2018-11-07 18:33:08 +02:00 |
|