coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-06 17:26:31 +02:00

Author	SHA1	Message	Date
Gal Leibovich	16c5032735	fix for tensorboard visualization slowing execution even when it is off apparently tensorflow still collect summary data even when no summary FileWriter is defined.	2018-02-18 16:35:24 +02:00
Itai Caspi	72d34f4063	adding a flag to prevent summary	2018-02-15 13:47:14 +02:00
Itai Caspi	55c8c87afc	allow visualizing the observation + bug fixes to coach summary	2018-02-15 13:47:14 +02:00
Itai Caspi	5d1a2bc392	Adding a summary when exiting coach	2018-02-13 11:11:26 +02:00
Itai Caspi	ba96e585d2	appending csv's from logger instead of rewriting them	2018-02-12 14:52:50 +02:00
Itai Caspi	569ca39ce6	Dashboard color selection + removing old legend	2018-02-09 14:52:58 +02:00
Itai Caspi	8a4383e86f	Added an improved legend to dashboard	2018-02-08 16:48:46 +02:00
Itai Caspi	b071599cb0	updating intel optimized tensorflow to version 1.4	2018-02-08 09:20:29 +02:00
Itai Caspi	462fe9796b	several bug fixes in dashboard	2018-02-07 12:49:34 +02:00
galleibo-intel	4025496783	Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)	2018-02-05 15:48:00 +02:00
Gal Leibovich	7c8962c991	adding support in tensorboard (#52 ) * bug-fix in architecture.py where additional fetches would acquire more entries than it should * change in run_test to allow ignoring some test(s)	2018-02-05 15:21:49 +02:00
Itai Caspi	a8d5fb7bdf	Added a table of contents to the README	2018-01-27 14:31:53 +02:00
Itai Caspi	522c837e76	Update README.md	2018-01-22 12:15:23 +02:00
Itai Caspi	43821c9630	adding the selu activation	2018-01-22 12:05:43 +02:00
Zach Dwiel	fff8c8f568	provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent	2018-01-20 14:11:24 -05:00
Zach Dwiel	40e5c628c6	add options for more verbose test errors	2018-01-16 22:08:46 -05:00
Zach Dwiel	8f026bb46f	Merge pull request #42 from NervanaSystems/print_parameters provide a command line option which prints the tuning_parameters to stdout	2018-01-11 11:28:24 -05:00
Zach Dwiel	c7b11f1e9a	provide a command line option which prints the tuning_parameters to stdout	2018-01-10 16:28:41 -05:00
Zach Dwiel	9b963c86d0	Merge pull request #41 from NervanaSystems/allow_direct_entry_point allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 12:18:12 -05:00
Zach Dwiel	cc76a9ad70	allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 10:14:23 -05:00
Itai Caspi	42f68f2e8a	update the README with contact mail + small reformatting	2018-01-09 13:08:23 +02:00
Itai Caspi	eeb3ec5497	fixed the LSTM middleware initialization	2018-01-09 10:26:15 +02:00
Itai Caspi	b435c6d2d7	updated the links to the new Intel AI website	2018-01-09 10:25:06 +02:00
Zach Dwiel	499e78596a	Merge pull request #38 from NervanaSystems/nec_lstm update nec and value optimization agents to work with recurrent middleware	2018-01-08 14:01:34 -05:00
Justin	29857412b3	Add force flag to library symbolic link Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.	2018-01-08 20:36:54 +02:00
Zach Dwiel	6c79a442f2	update nec and value optimization agents to work with recurrent middleware	2018-01-05 20:16:51 -05:00
Itai Caspi	645d9d47a9	Adding bibtex to the README	2018-01-03 21:11:57 +02:00
Itai Caspi	93a54c7e8e	Added a link to the 2nd blog post	2017-12-20 17:18:49 +02:00
Itai Caspi	9e59d1960e	bug fix for dumping gifs from doom	2017-12-20 13:10:34 +02:00
Zach Dwiel	37e317682b	allow missing carla environment and missing matplotlib package	2017-12-20 11:47:14 +02:00
Itai Caspi	125c7ee38d	Release 0.9 Main changes are detailed below: New features - * CARLA 0.7 simulator integration * Human control of the game play * Recording of human game play and storing / loading the replay buffer * Behavioral cloning agent and presets * Golden tests for several presets * Selecting between deep / shallow image embedders * Rendering through pygame (with some boost in performance) API changes - * Improved environment wrapper API * Added an evaluate flag to allow convenient evaluation of existing checkpoints * Improve frameskip definition in Gym Bug fixes - * Fixed loading of checkpoints for agents with more than one network * Fixed the N Step Q learning agent python3 compatibility v0.9.0	2017-12-19 19:27:16 +02:00
Itai Caspi	11faf19649	QR-DQN bug fix and imporvements (#30 ) * bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)	2017-11-29 14:01:59 +02:00
Zach Dwiel	7bdba396d2	Update add_env.md	2017-11-14 17:57:55 +02:00
Zach Dwiel	9ae2905a76	clean up input embeddings setup	2017-11-14 17:39:18 +02:00
Itai Caspi	1ff0da2165	bug fix - fixed an issue with gifs dumping and bumped up Pillow version to 4.3.0	2017-11-13 12:22:42 +02:00
Miguel Morales	acd2b78a9e	Update README.md Fix algorithms list to be consistent with "<full name> (<acronym>)"	2017-11-12 16:00:00 +02:00
Itai Caspi	8d9ee4ea2b	bug fix - fixed C51 presets hyperparameters	2017-11-10 13:22:42 +02:00
galleibo-intel	3c330768f0	Fix for NEC not saving the DND when saving a model	2017-11-09 19:13:23 +02:00
Itai Caspi	f5d645d8a6	resize training curves images	2017-11-09 09:13:12 +02:00
Itai Caspi	8ee9e46083	fixing some typos in the benchmarks README	2017-11-09 08:58:52 +02:00
Itai Caspi	c798be7bfb	added training curves for some of the presets	2017-11-09 08:54:34 +02:00
cxx	84e536d371	Fix std calculation using unbiased estimation in sharing stat mode.	2017-11-07 20:19:54 +02:00
galleibo-intel	f47b8092af	fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore	2017-11-06 19:41:32 +02:00
Itai Caspi	b40259c61a	bug fix - remove import warning when everything was imported successfully + changed global step api to match TF 1.4	2017-11-06 17:28:13 +02:00
Itai Caspi	fd103a7b69	updated the algorithms diagram with QR-DQN	2017-11-01 15:24:54 +02:00
Itai Caspi	a8bce9828c	new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf ) API change - Distributional DQN renamed to Categorical DQN	2017-11-01 15:09:07 +02:00
Itai Caspi	1ad6262307	bug fix - correcting the evaluation exploration control parameter logging	2017-10-31 13:50:40 +02:00
Itai Caspi	e38611b9eb	bug fix - updating Doom_Health_DFP and Breakout_DQN presets	2017-10-31 10:54:14 +02:00
Itai Caspi	913ab75e8a	bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head	2017-10-31 10:51:48 +02:00
Itai Caspi	1918f16079	imporved API for getting / setting variables within the graph	2017-10-31 10:51:48 +02:00

1 2

80 Commits