Ajay Deshpande
4a6c404070
Adding worker logs and plumbed task_parameters to distributed coach ( #130 )
2018-11-23 15:35:11 -08:00
Gal Leibovich
2b4c9c6774
Removing grarph_manager param ( #141 )
2018-11-23 11:42:54 -08:00
Gal Leibovich
a1c56edd98
Fixes for having NumpySharedRunningStats syncing on multi-node ( #139 )
...
1. Having the standard checkpoint prefix in order for the data store to grab it, and sync it to S3.
2. Removing the reference to Redis so that it won't try to pickle that in.
3. Enable restoring a checkpoint into a single-worker run, which was saved by a single-node-multiple-worker run.
2018-11-23 16:11:47 +02:00
Gal Leibovich
d4d06aaea6
remove kubernetes dependency ( #117 )
2018-11-18 18:10:22 +02:00
Balaji Subramaniam
101c55d37d
Handle both Environment Steps and Episodes on the subscriber side. ( #99 )
2018-11-15 14:42:21 -08:00
Ajay Deshpande
fde73ced13
Simulating the act on the trainer. ( #65 )
...
* Remove the use of daemon threads for Redis subscribe.
* Emulate act and observe on trainer side to update internal vars.
2018-11-15 08:38:58 -08:00
Itai Caspi
6d40ad1650
update of api docstrings across coach and tutorials [WIP] ( #91 )
...
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00
Leo Dirac
2804a7c24f
Refactor launcher to be object-oriented ( #63 )
...
* Import of annoy library uses failed_import mechanism.
2018-11-10 22:10:19 +02:00
Gal Leibovich
49dea39d34
N-step returns for rainbow ( #67 )
...
* n_step returns for rainbow
* Rename CartPole_PPO -> CartPole_ClippedPPO
2018-11-07 18:33:08 +02:00
Ajay Deshpande
fb2721fffa
Removing comments
2018-10-23 19:59:02 -04:00
Ajay Deshpande
9a30c26469
Adding improvements
2018-10-23 19:59:02 -04:00
zach dwiel
787ab42578
remove extra call to super().store_episode
2018-10-23 19:58:17 -04:00
Ajay Deshpande
0e121c5762
Ignoring redis sub if testing
2018-10-23 16:55:37 -04:00
Ajay Deshpande
7f00235ed5
waiting for a new checkpoint if it's available
2018-10-23 16:54:43 -04:00
Ajay Deshpande
6b2de6ba6d
Adding initial interface for backend and redis pubsub ( #19 )
...
* Adding initial interface for backend and redis pubsub
* Addressing comments, adding super in all memories
* Removing distributed experience replay
2018-10-23 16:51:48 -04:00
Ajay Deshpande
98850464cc
Adding nfs pv, pvc, waiting for memory to be full
2018-10-23 16:50:48 -04:00
Ajay Deshpande
ce9838a7d6
Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds
2018-10-23 16:46:04 -04:00
Ajay Deshpande
21f8ca3978
Removing comments and pytests
2018-10-23 16:40:33 -04:00
Ajay Deshpande
5a54f67a63
Adding distributed experience replay
2018-10-23 16:40:33 -04:00
Zach Dwiel
5758c2f23e
typo; increased detail in comment
2018-10-23 16:35:06 -04:00
Zach Dwiel
a1295d16b3
first pass that transition collection interface
2018-10-23 16:35:06 -04:00
Zach Dwiel
9f1f9e5ab4
replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)
2018-10-23 16:34:38 -04:00
Zach Dwiel
cccfe88f9b
remove unused method: update_last_transition_info
2018-10-23 16:34:38 -04:00
itaicaspi-intel
d3f97cd93b
initial CIL implementation (WIP)
2018-09-13 15:29:29 +03:00
itaicaspi-intel
607ef17431
added a simple progress bar implementation
2018-09-13 14:21:38 +03:00
itaicaspi-intel
a16d724963
removing some of the presets from the trace tests + more robust replay buffer loading
2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4
load and save function for non-episodic replay buffers + carla improvements + network bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
fd2f4b0852
bug fix in HRL HER memory + some small improvements
2018-08-29 14:36:18 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00