coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-21 17:25:53 +01:00

Author	SHA1	Message	Date
Gal Leibovich	19ad2d60a7	Batch RL Tutorial (#372 )	2019-07-14 18:43:48 +03:00
Gal Leibovich	acceb03ac0	bug fixes for OPE (#311 )	2019-05-21 16:39:11 +03:00
Gal Leibovich	582921ffe3	OPE: Weighted Importance Sampling (#299 )	2019-05-02 19:25:42 +03:00
Gal Leibovich	6e08c55ad5	Enabling-more-agents-for-Batch-RL-and-cleanup (#258 ) allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups	2019-03-21 16:10:29 +02:00
Gal Leibovich	e3c7e526c7	Batch RL (#238 )	2019-03-19 18:07:09 +02:00
Ajay Deshpande	6b2de6ba6d	Adding initial interface for backend and redis pubsub (#19 ) * Adding initial interface for backend and redis pubsub * Addressing comments, adding super in all memories * Removing distributed experience replay	2018-10-23 16:51:48 -04:00
Zach Dwiel	9f1f9e5ab4	replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)	2018-10-23 16:34:38 -04:00
Zach Dwiel	cccfe88f9b	remove unused method: update_last_transition_info	2018-10-23 16:34:38 -04:00
itaicaspi-intel	607ef17431	added a simple progress bar implementation	2018-09-13 14:21:38 +03:00
itaicaspi-intel	a16d724963	removing some of the presets from the trace tests + more robust replay buffer loading	2018-09-12 15:26:16 +03:00
itaicaspi-intel	a9bd1047c4	load and save function for non-episodic replay buffers + carla improvements + network bug fixes	2018-09-12 15:26:16 +03:00
itaicaspi-intel	658b437079	removing datasets + imports optimization	2018-08-27 10:54:11 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

13 Commits