coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-16 05:55:46 +01:00

Author	SHA1	Message	Date
Gal Leibovich	138ced23ba	RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 ) * Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter	2019-09-08 12:53:49 +03:00
Gal Leibovich	a1c56edd98	Fixes for having NumpySharedRunningStats syncing on multi-node (#139 ) 1. Having the standard checkpoint prefix in order for the data store to grab it, and sync it to S3. 2. Removing the reference to Redis so that it won't try to pickle that in. 3. Enable restoring a checkpoint into a single-worker run, which was saved by a single-node-multiple-worker run.	2018-11-23 16:11:47 +02:00
Leo Dirac	2804a7c24f	Refactor launcher to be object-oriented (#63 ) * Import of annoy library uses failed_import mechanism.	2018-11-10 22:10:19 +02:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00