Balaji Subramaniam
101c55d37d
Handle both Environment Steps and Episodes on the subscriber side. ( #99 )
2018-11-15 14:42:21 -08:00
Ajay Deshpande
fde73ced13
Simulating the act on the trainer. ( #65 )
...
* Remove the use of daemon threads for Redis subscribe.
* Emulate act and observe on trainer side to update internal vars.
2018-11-15 08:38:58 -08:00
Itai Caspi
6d40ad1650
update of api docstrings across coach and tutorials [WIP] ( #91 )
...
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00
Leo Dirac
2804a7c24f
Refactor launcher to be object-oriented ( #63 )
...
* Import of annoy library uses failed_import mechanism.
2018-11-10 22:10:19 +02:00
Gal Leibovich
49dea39d34
N-step returns for rainbow ( #67 )
...
* n_step returns for rainbow
* Rename CartPole_PPO -> CartPole_ClippedPPO
2018-11-07 18:33:08 +02:00
Ajay Deshpande
fb2721fffa
Removing comments
2018-10-23 19:59:02 -04:00
Ajay Deshpande
9a30c26469
Adding improvements
2018-10-23 19:59:02 -04:00
zach dwiel
787ab42578
remove extra call to super().store_episode
2018-10-23 19:58:17 -04:00
Ajay Deshpande
0e121c5762
Ignoring redis sub if testing
2018-10-23 16:55:37 -04:00
Ajay Deshpande
7f00235ed5
waiting for a new checkpoint if it's available
2018-10-23 16:54:43 -04:00
Ajay Deshpande
6b2de6ba6d
Adding initial interface for backend and redis pubsub ( #19 )
...
* Adding initial interface for backend and redis pubsub
* Addressing comments, adding super in all memories
* Removing distributed experience replay
2018-10-23 16:51:48 -04:00
Ajay Deshpande
98850464cc
Adding nfs pv, pvc, waiting for memory to be full
2018-10-23 16:50:48 -04:00
Ajay Deshpande
ce9838a7d6
Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds
2018-10-23 16:46:04 -04:00
Ajay Deshpande
21f8ca3978
Removing comments and pytests
2018-10-23 16:40:33 -04:00
Ajay Deshpande
5a54f67a63
Adding distributed experience replay
2018-10-23 16:40:33 -04:00
Zach Dwiel
5758c2f23e
typo; increased detail in comment
2018-10-23 16:35:06 -04:00
Zach Dwiel
a1295d16b3
first pass that transition collection interface
2018-10-23 16:35:06 -04:00
Zach Dwiel
9f1f9e5ab4
replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)
2018-10-23 16:34:38 -04:00
Zach Dwiel
cccfe88f9b
remove unused method: update_last_transition_info
2018-10-23 16:34:38 -04:00
itaicaspi-intel
d3f97cd93b
initial CIL implementation (WIP)
2018-09-13 15:29:29 +03:00
itaicaspi-intel
607ef17431
added a simple progress bar implementation
2018-09-13 14:21:38 +03:00
itaicaspi-intel
a16d724963
removing some of the presets from the trace tests + more robust replay buffer loading
2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4
load and save function for non-episodic replay buffers + carla improvements + network bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
fd2f4b0852
bug fix in HRL HER memory + some small improvements
2018-08-29 14:36:18 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00