coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 01:46:31 +02:00

Author	SHA1	Message	Date
galleibo-intel	3c330768f0	Fix for NEC not saving the DND when saving a model	2017-11-09 19:13:23 +02:00
galleibo-intel	f47b8092af	fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore	2017-11-06 19:41:32 +02:00
Itai Caspi	a8bce9828c	new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf ) API change - Distributional DQN renamed to Categorical DQN	2017-11-01 15:09:07 +02:00
Itai Caspi	913ab75e8a	bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head	2017-10-31 10:51:48 +02:00
Itai Caspi	1918f16079	imporved API for getting / setting variables within the graph	2017-10-31 10:51:48 +02:00
cxx	f43c951c2d	Unify base class using new-style (object).	2017-10-26 12:33:09 +03:00
Itai Caspi	39cf78074c	preventing the evaluation agent from getting stuck in bad policies by updating from the global network during episodes	2017-10-25 10:28:45 +03:00
Gal Leibovich	eb0b57d7fa	Updating PPO references per issue #11	2017-10-24 16:57:44 +03:00
Gal Leibovich	1d4c3455e7	coach v0.8.0	2017-10-19 13:10:15 +03:00