coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-01-31 04:55:50 +01:00

Author	SHA1	Message	Date
Ajay Deshpande	4a6c404070	Adding worker logs and plumbed task_parameters to distributed coach (#130 )	2018-11-23 15:35:11 -08:00
Gal Leibovich	2b4c9c6774	Removing grarph_manager param (#141 )	2018-11-23 11:42:54 -08:00
Gal Leibovich	a1c56edd98	Fixes for having NumpySharedRunningStats syncing on multi-node (#139 ) 1. Having the standard checkpoint prefix in order for the data store to grab it, and sync it to S3. 2. Removing the reference to Redis so that it won't try to pickle that in. 3. Enable restoring a checkpoint into a single-worker run, which was saved by a single-node-multiple-worker run.	2018-11-23 16:11:47 +02:00
Gal Leibovich	d4d06aaea6	remove kubernetes dependency (#117 )	2018-11-18 18:10:22 +02:00
Balaji Subramaniam	101c55d37d	Handle both Environment Steps and Episodes on the subscriber side. (#99 )	2018-11-15 14:42:21 -08:00
Ajay Deshpande	fde73ced13	Simulating the act on the trainer. (#65 ) * Remove the use of daemon threads for Redis subscribe. * Emulate act and observe on trainer side to update internal vars.	2018-11-15 08:38:58 -08:00
Itai Caspi	6d40ad1650	update of api docstrings across coach and tutorials [WIP] (#91 ) * updating the documentation website * adding the built docs * update of api docstrings across coach and tutorials 0-2 * added some missing api documentation * New Sphinx based documentation	2018-11-15 15:00:13 +02:00
Leo Dirac	2804a7c24f	Refactor launcher to be object-oriented (#63 ) * Import of annoy library uses failed_import mechanism.	2018-11-10 22:10:19 +02:00
Gal Leibovich	49dea39d34	N-step returns for rainbow (#67 ) * n_step returns for rainbow * Rename CartPole_PPO -> CartPole_ClippedPPO	2018-11-07 18:33:08 +02:00
Ajay Deshpande	fb2721fffa	Removing comments	2018-10-23 19:59:02 -04:00
Ajay Deshpande	9a30c26469	Adding improvements	2018-10-23 19:59:02 -04:00
zach dwiel	787ab42578	remove extra call to super().store_episode	2018-10-23 19:58:17 -04:00
Ajay Deshpande	0e121c5762	Ignoring redis sub if testing	2018-10-23 16:55:37 -04:00
Ajay Deshpande	7f00235ed5	waiting for a new checkpoint if it's available	2018-10-23 16:54:43 -04:00
Ajay Deshpande	6b2de6ba6d	Adding initial interface for backend and redis pubsub (#19 ) * Adding initial interface for backend and redis pubsub * Addressing comments, adding super in all memories * Removing distributed experience replay	2018-10-23 16:51:48 -04:00
Ajay Deshpande	98850464cc	Adding nfs pv, pvc, waiting for memory to be full	2018-10-23 16:50:48 -04:00
Ajay Deshpande	ce9838a7d6	Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds	2018-10-23 16:46:04 -04:00
Ajay Deshpande	21f8ca3978	Removing comments and pytests	2018-10-23 16:40:33 -04:00
Ajay Deshpande	5a54f67a63	Adding distributed experience replay	2018-10-23 16:40:33 -04:00
Zach Dwiel	5758c2f23e	typo; increased detail in comment	2018-10-23 16:35:06 -04:00
Zach Dwiel	a1295d16b3	first pass that transition collection interface	2018-10-23 16:35:06 -04:00
Zach Dwiel	9f1f9e5ab4	replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)	2018-10-23 16:34:38 -04:00
Zach Dwiel	cccfe88f9b	remove unused method: update_last_transition_info	2018-10-23 16:34:38 -04:00
itaicaspi-intel	d3f97cd93b	initial CIL implementation (WIP)	2018-09-13 15:29:29 +03:00
itaicaspi-intel	607ef17431	added a simple progress bar implementation	2018-09-13 14:21:38 +03:00
itaicaspi-intel	a16d724963	removing some of the presets from the trace tests + more robust replay buffer loading	2018-09-12 15:26:16 +03:00
itaicaspi-intel	a9bd1047c4	load and save function for non-episodic replay buffers + carla improvements + network bug fixes	2018-09-12 15:26:16 +03:00
itaicaspi-intel	fd2f4b0852	bug fix in HRL HER memory + some small improvements	2018-08-29 14:36:18 +03:00
itaicaspi-intel	658b437079	removing datasets + imports optimization	2018-08-27 10:54:11 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

30 Commits