coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-06 17:26:31 +02:00

Author	SHA1	Message	Date
itaicaspi-intel	5d5562bf62	moving the docs to github	2018-04-23 09:14:20 +03:00
jtoy	cafa152382	update requirements to have valid tornado version (#84 )	2018-04-02 14:21:35 +03:00
Itai Caspi	a7206ed702	Multiple improvements and bug fixes (#66 ) * Multiple improvements and bug fixes: * Using lazy stacking to save on memory when using a replay buffer * Remove step counting for evaluation episodes * Reset game between heatup and training * Major bug fixes in NEC (is reproducing the paper results for pong now) * Image input rescaling to 0-1 is now optional * Change the terminal title to be the experiment name * Observation cropping for atari is now optional * Added random number of noop actions for gym to match the dqn paper * Fixed a bug where the evaluation episodes won't start with the max possible ale lives * Added a script for plotting the results of an experiment over all the atari games	2018-02-26 12:29:07 +02:00
Zach Dwiel	4fe9cba445	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	eba900067c	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	d1bf83047c	remove debug	2018-02-21 10:05:57 -05:00
Zach Dwiel	ef46e194af	remove unused commented code	2018-02-21 10:05:57 -05:00
Zach Dwiel	d9303e731e	remove python2 compatibility	2018-02-21 10:05:57 -05:00
Zach Dwiel	ec68bd4959	make sure that for now observation spaces all include an observation key	2018-02-21 10:05:57 -05:00
Zach Dwiel	0740ebcdac	by default assume state["observation"] is where the image for rendering can be found	2018-02-21 10:05:57 -05:00
Zach Dwiel	f9f92a42fd	cleanup debugging code	2018-02-21 10:05:57 -05:00
Zach Dwiel	86362683b1	comment	2018-02-21 10:05:57 -05:00
Zach Dwiel	8fc24a2bbe	fix bc_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	d8f5a35013	fix qr_dqn_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	e1ad86417f	fix n_step_q_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	5cf10e5f52	fix bug in ddpg	2018-02-21 10:05:57 -05:00
Zach Dwiel	8248caf35e	fix more agents	2018-02-21 10:05:57 -05:00
Zach Dwiel	98f57a0d87	fix ddpg	2018-02-21 10:05:57 -05:00
Zach Dwiel	943e41ba58	fix nec_agent	2018-02-21 10:05:57 -05:00
Zach Dwiel	ee6e0bdc3b	fix keep_dims -> keepdims	2018-02-21 10:05:57 -05:00
Zach Dwiel	39a28aba95	fix clipped ppo	2018-02-21 10:05:57 -05:00
Zach Dwiel	85afb86893	temp commit	2018-02-21 10:05:57 -05:00
Gal Leibovich	16c5032735	fix for tensorboard visualization slowing execution even when it is off apparently tensorflow still collect summary data even when no summary FileWriter is defined.	2018-02-18 16:35:24 +02:00
Itai Caspi	72d34f4063	adding a flag to prevent summary	2018-02-15 13:47:14 +02:00
Itai Caspi	55c8c87afc	allow visualizing the observation + bug fixes to coach summary	2018-02-15 13:47:14 +02:00
Itai Caspi	5d1a2bc392	Adding a summary when exiting coach	2018-02-13 11:11:26 +02:00
Itai Caspi	ba96e585d2	appending csv's from logger instead of rewriting them	2018-02-12 14:52:50 +02:00
Itai Caspi	569ca39ce6	Dashboard color selection + removing old legend	2018-02-09 14:52:58 +02:00
Itai Caspi	8a4383e86f	Added an improved legend to dashboard	2018-02-08 16:48:46 +02:00
Itai Caspi	b071599cb0	updating intel optimized tensorflow to version 1.4	2018-02-08 09:20:29 +02:00
Itai Caspi	462fe9796b	several bug fixes in dashboard	2018-02-07 12:49:34 +02:00
galleibo-intel	4025496783	Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)	2018-02-05 15:48:00 +02:00
Gal Leibovich	7c8962c991	adding support in tensorboard (#52 ) * bug-fix in architecture.py where additional fetches would acquire more entries than it should * change in run_test to allow ignoring some test(s)	2018-02-05 15:21:49 +02:00
Itai Caspi	a8d5fb7bdf	Added a table of contents to the README	2018-01-27 14:31:53 +02:00
Itai Caspi	522c837e76	Update README.md	2018-01-22 12:15:23 +02:00
Itai Caspi	43821c9630	adding the selu activation	2018-01-22 12:05:43 +02:00
Zach Dwiel	fff8c8f568	provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent	2018-01-20 14:11:24 -05:00
Zach Dwiel	40e5c628c6	add options for more verbose test errors	2018-01-16 22:08:46 -05:00
Zach Dwiel	8f026bb46f	Merge pull request #42 from NervanaSystems/print_parameters provide a command line option which prints the tuning_parameters to stdout	2018-01-11 11:28:24 -05:00
Zach Dwiel	c7b11f1e9a	provide a command line option which prints the tuning_parameters to stdout	2018-01-10 16:28:41 -05:00
Zach Dwiel	9b963c86d0	Merge pull request #41 from NervanaSystems/allow_direct_entry_point allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 12:18:12 -05:00
Zach Dwiel	cc76a9ad70	allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 10:14:23 -05:00
Itai Caspi	42f68f2e8a	update the README with contact mail + small reformatting	2018-01-09 13:08:23 +02:00
Itai Caspi	eeb3ec5497	fixed the LSTM middleware initialization	2018-01-09 10:26:15 +02:00
Itai Caspi	b435c6d2d7	updated the links to the new Intel AI website	2018-01-09 10:25:06 +02:00
Zach Dwiel	499e78596a	Merge pull request #38 from NervanaSystems/nec_lstm update nec and value optimization agents to work with recurrent middleware	2018-01-08 14:01:34 -05:00
Justin	29857412b3	Add force flag to library symbolic link Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.	2018-01-08 20:36:54 +02:00
Zach Dwiel	6c79a442f2	update nec and value optimization agents to work with recurrent middleware	2018-01-05 20:16:51 -05:00
Itai Caspi	645d9d47a9	Adding bibtex to the README	2018-01-03 21:11:57 +02:00
Itai Caspi	93a54c7e8e	Added a link to the 2nd blog post	2017-12-20 17:18:49 +02:00

1 2 3

102 Commits