coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-08 10:26:32 +02:00

Author	SHA1	Message	Date
Gal Novik	79b05a8105	Wolpertinger preset failure fix (#434 ) Numpy 1.18 fails to cast float to int as part of the wolpertinger preset run	2020-01-14 16:26:38 +02:00
Gal Leibovich	138ced23ba	RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 ) * Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter	2019-09-08 12:53:49 +03:00
Gal Leibovich	c1d1fae342	Distiller's AMC induced changes (#359 ) * override episode rewards with the last transition reward * EWMA normalization filter * allowing control over when the pre_network filter runs	2019-08-05 10:24:58 +03:00
Gal Leibovich	19ad2d60a7	Batch RL Tutorial (#372 )	2019-07-14 18:43:48 +03:00
Gal Leibovich	4c996e147e	applying filters for a csv loaded dataset + some bug-fixes in data loading (#319 )	2019-05-28 15:44:55 +03:00
Gal Leibovich	30c2b2fc45	moving to skimage.transform.resize (#321 )	2019-05-23 13:38:01 +03:00
Zach Dwiel	cd812b0d25	more clear names for methods of Space (#181 ) * rename Space.val_matches_space_definition -> contains; Space.is_point_in_space_shape -> valid_index * rename valid_index -> is_valid_index	2019-01-14 15:02:53 -05:00
Gal Leibovich	4c914c057c	fix for finding the right filter checkpoint to restore + do not update internal filter state when evaluating + fix SharedRunningStats checkpoint filenames (#147 )	2018-12-17 21:36:27 +02:00
Gal Leibovich	f9ee526536	Fix for issue #128 - circular DQN import (#130 )	2018-12-16 16:06:44 +02:00
Ryan Peach	28e5b8b612	Minor bugfix on RewardFilter in Readme (#133 )	2018-11-30 16:02:08 -08:00
Gal Leibovich	a1c56edd98	Fixes for having NumpySharedRunningStats syncing on multi-node (#139 ) 1. Having the standard checkpoint prefix in order for the data store to grab it, and sync it to S3. 2. Removing the reference to Redis so that it won't try to pickle that in. 3. Enable restoring a checkpoint into a single-worker run, which was saved by a single-node-multiple-worker run.	2018-11-23 16:11:47 +02:00
Sina Afrooze	87a7848b0a	Moved tf.variable_scope and tf.device calls to framework-specific architecture (#136 )	2018-11-22 22:52:21 +02:00
Gal Leibovich	a112ee69f6	Save filters' internal state (#127 ) * save filters internal state * moving the restore to be made from within NumpyRunningStats	2018-11-20 17:21:48 +02:00
Gal Leibovich	6caf721d1c	Numpy shared running stats (#97 )	2018-11-18 14:46:40 +02:00
Ajay Deshpande	fde73ced13	Simulating the act on the trainer. (#65 ) * Remove the use of daemon threads for Redis subscribe. * Emulate act and observe on trainer side to update internal vars.	2018-11-15 08:38:58 -08:00
Itai Caspi	6d40ad1650	update of api docstrings across coach and tutorials [WIP] (#91 ) * updating the documentation website * adding the built docs * update of api docstrings across coach and tutorials 0-2 * added some missing api documentation * New Sphinx based documentation	2018-11-15 15:00:13 +02:00
Balaji Subramaniam	a849c17e46	Enable distributed SharedRunningStats (#81 ) - Use Redis pub/sub for updating SharedRunningStats.	2018-11-13 19:17:38 +02:00
itaicaspi-intel	171fe97a3a	imitation related bug fixes	2018-09-12 15:26:16 +03:00
itaicaspi-intel	658b437079	removing datasets + imports optimization	2018-08-27 10:54:11 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

20 Commits