coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-06 17:26:31 +02:00

Author	SHA1	Message	Date
Itai Caspi	d44c329bb8	Update README.md	2018-06-25 17:46:01 +03:00
Itai Caspi	cfd4fe0faf	Update README.md	2018-06-25 17:43:15 +03:00
Gal Leibovich	2807c29f27	fix for measurements in the initial state (fix for DFP)	2018-05-29 16:47:38 +03:00
itaicaspi-intel	7725dabc86	checkpoints bug fix	2018-05-26 17:49:13 +03:00
itaicaspi-intel	462c6e314b	bug fix in nec checkpoint saving	2018-05-24 15:15:33 +03:00
Itai Caspi	d302168c8c	Parallel agents fixes (#95 ) * Parallel agents related bug fixes: checkpoint restore, tensorboard integration. Adding narrow networks support. Reference code for unlimited number of checkpoints	2018-05-24 14:24:19 +03:00
itaicaspi-intel	6c0b59b4de	constraining gym installation to version 0.9.4	2018-05-22 11:01:58 +03:00
itaicaspi-intel	a57b7004a8	updating dashboard	2018-05-09 09:26:15 +03:00
Gal Novik	dafdb05a7c	bug fixes for clippedppo and checkpoints	2018-04-30 15:13:29 +03:00
Itai Caspi	f31159aad6	bug fixes for carla environment (#93 )	2018-04-23 11:13:24 +03:00
Itai Caspi	52eb159f69	multiple bug fixes in dealing with measurements + CartPole_DFP preset (#92 )	2018-04-23 10:44:46 +03:00
itaicaspi-intel	5d5562bf62	moving the docs to github	2018-04-23 09:14:20 +03:00
jtoy	cafa152382	update requirements to have valid tornado version (#84 )	2018-04-02 14:21:35 +03:00
Itai Caspi	a7206ed702	Multiple improvements and bug fixes (#66 ) * Multiple improvements and bug fixes: * Using lazy stacking to save on memory when using a replay buffer * Remove step counting for evaluation episodes * Reset game between heatup and training * Major bug fixes in NEC (is reproducing the paper results for pong now) * Image input rescaling to 0-1 is now optional * Change the terminal title to be the experiment name * Observation cropping for atari is now optional * Added random number of noop actions for gym to match the dqn paper * Fixed a bug where the evaluation episodes won't start with the max possible ale lives * Added a script for plotting the results of an experiment over all the atari games	2018-02-26 12:29:07 +02:00
Zach Dwiel	4fe9cba445	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	eba900067c	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	d1bf83047c	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	ef46e194af	remove unused commented code	2018-02-21 10:05:57 -05:00
Zach Dwiel	d9303e731e	remove python2 compatibility	2018-02-21 10:05:57 -05:00
Zach Dwiel	ec68bd4959	make sure that for now observation spaces all include an observation key	2018-02-21 10:05:57 -05:00
Zach Dwiel	0740ebcdac	by default assume state["observation"] is where the image for rendering can be found	2018-02-21 10:05:57 -05:00
Zach Dwiel	f9f92a42fd	cleanup debugging code	2018-02-21 10:05:57 -05:00
Zach Dwiel	86362683b1	comment	2018-02-21 10:05:57 -05:00
Zach Dwiel	8fc24a2bbe	fix bc_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	d8f5a35013	fix qr_dqn_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	e1ad86417f	fix n_step_q_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	5cf10e5f52	fix bug in ddpg	2018-02-21 10:05:57 -05:00
Zach Dwiel	8248caf35e	fix more agents	2018-02-21 10:05:57 -05:00
Zach Dwiel	98f57a0d87	fix ddpg	2018-02-21 10:05:57 -05:00
Zach Dwiel	943e41ba58	fix nec_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	ee6e0bdc3b	fix keep_dims -> keepdims	2018-02-21 10:05:57 -05:00
Zach Dwiel	39a28aba95	fix clipped ppo	2018-02-21 10:05:57 -05:00
Zach Dwiel	85afb86893	temp commit	2018-02-21 10:05:57 -05:00
Gal Leibovich	16c5032735	fix for tensorboard visualization slowing execution even when it is off apparently tensorflow still collect summary data even when no summary FileWriter is defined.	2018-02-18 16:35:24 +02:00
Itai Caspi	72d34f4063	adding a flag to prevent summary	2018-02-15 13:47:14 +02:00
Itai Caspi	55c8c87afc	allow visualizing the observation + bug fixes to coach summary	2018-02-15 13:47:14 +02:00
Itai Caspi	5d1a2bc392	Adding a summary when exiting coach	2018-02-13 11:11:26 +02:00
Itai Caspi	ba96e585d2	appending csv's from logger instead of rewriting them	2018-02-12 14:52:50 +02:00
Itai Caspi	569ca39ce6	Dashboard color selection + removing old legend	2018-02-09 14:52:58 +02:00
Itai Caspi	8a4383e86f	Added an improved legend to dashboard	2018-02-08 16:48:46 +02:00
Itai Caspi	b071599cb0	updating intel optimized tensorflow to version 1.4	2018-02-08 09:20:29 +02:00
Itai Caspi	462fe9796b	several bug fixes in dashboard	2018-02-07 12:49:34 +02:00
galleibo-intel	4025496783	Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)	2018-02-05 15:48:00 +02:00
Gal Leibovich	7c8962c991	adding support in tensorboard (#52 ) * bug-fix in architecture.py where additional fetches would acquire more entries than it should * change in run_test to allow ignoring some test(s)	2018-02-05 15:21:49 +02:00
Itai Caspi	a8d5fb7bdf	Added a table of contents to the README	2018-01-27 14:31:53 +02:00
Itai Caspi	522c837e76	Update README.md	2018-01-22 12:15:23 +02:00
Itai Caspi	43821c9630	adding the selu activation	2018-01-22 12:05:43 +02:00
Zach Dwiel	fff8c8f568	provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent	2018-01-20 14:11:24 -05:00
Zach Dwiel	40e5c628c6	add options for more verbose test errors	2018-01-16 22:08:46 -05:00
Zach Dwiel	8f026bb46f	Merge pull request #42 from NervanaSystems/print_parameters provide a command line option which prints the tuning_parameters to stdout	2018-01-11 11:28:24 -05:00

1 2 3

113 Commits