coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-09 10:05:50 +01:00

Author	SHA1	Message	Date
Itai Caspi	8a4383e86f	Added an improved legend to dashboard	2018-02-08 16:48:46 +02:00
Itai Caspi	b071599cb0	updating intel optimized tensorflow to version 1.4	2018-02-08 09:20:29 +02:00
Itai Caspi	462fe9796b	several bug fixes in dashboard	2018-02-07 12:49:34 +02:00
galleibo-intel	4025496783	Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)	2018-02-05 15:48:00 +02:00
Gal Leibovich	7c8962c991	adding support in tensorboard (#52 ) * bug-fix in architecture.py where additional fetches would acquire more entries than it should * change in run_test to allow ignoring some test(s)	2018-02-05 15:21:49 +02:00
Itai Caspi	a8d5fb7bdf	Added a table of contents to the README	2018-01-27 14:31:53 +02:00
Itai Caspi	522c837e76	Update README.md	2018-01-22 12:15:23 +02:00
Itai Caspi	43821c9630	adding the selu activation	2018-01-22 12:05:43 +02:00
Zach Dwiel	fff8c8f568	provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent	2018-01-20 14:11:24 -05:00
Zach Dwiel	40e5c628c6	add options for more verbose test errors	2018-01-16 22:08:46 -05:00
Zach Dwiel	8f026bb46f	Merge pull request #42 from NervanaSystems/print_parameters provide a command line option which prints the tuning_parameters to stdout	2018-01-11 11:28:24 -05:00
Zach Dwiel	c7b11f1e9a	provide a command line option which prints the tuning_parameters to stdout	2018-01-10 16:28:41 -05:00
Zach Dwiel	9b963c86d0	Merge pull request #41 from NervanaSystems/allow_direct_entry_point allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 12:18:12 -05:00
Zach Dwiel	cc76a9ad70	allow specifying gym environments via entry point syntax: module.package:class	2018-01-10 10:14:23 -05:00
Itai Caspi	42f68f2e8a	update the README with contact mail + small reformatting	2018-01-09 13:08:23 +02:00
Itai Caspi	eeb3ec5497	fixed the LSTM middleware initialization	2018-01-09 10:26:15 +02:00
Itai Caspi	b435c6d2d7	updated the links to the new Intel AI website	2018-01-09 10:25:06 +02:00
Zach Dwiel	499e78596a	Merge pull request #38 from NervanaSystems/nec_lstm update nec and value optimization agents to work with recurrent middleware	2018-01-08 14:01:34 -05:00
Justin	29857412b3	Add force flag to library symbolic link Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.	2018-01-08 20:36:54 +02:00
Zach Dwiel	6c79a442f2	update nec and value optimization agents to work with recurrent middleware	2018-01-05 20:16:51 -05:00
Itai Caspi	645d9d47a9	Adding bibtex to the README	2018-01-03 21:11:57 +02:00
Itai Caspi	93a54c7e8e	Added a link to the 2nd blog post	2017-12-20 17:18:49 +02:00
Itai Caspi	9e59d1960e	bug fix for dumping gifs from doom	2017-12-20 13:10:34 +02:00
Zach Dwiel	37e317682b	allow missing carla environment and missing matplotlib package	2017-12-20 11:47:14 +02:00
Itai Caspi	125c7ee38d	Release 0.9 Main changes are detailed below: New features - * CARLA 0.7 simulator integration * Human control of the game play * Recording of human game play and storing / loading the replay buffer * Behavioral cloning agent and presets * Golden tests for several presets * Selecting between deep / shallow image embedders * Rendering through pygame (with some boost in performance) API changes - * Improved environment wrapper API * Added an evaluate flag to allow convenient evaluation of existing checkpoints * Improve frameskip definition in Gym Bug fixes - * Fixed loading of checkpoints for agents with more than one network * Fixed the N Step Q learning agent python3 compatibility v0.9.0	2017-12-19 19:27:16 +02:00
Itai Caspi	11faf19649	QR-DQN bug fix and imporvements (#30 ) * bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)	2017-11-29 14:01:59 +02:00
Zach Dwiel	7bdba396d2	Update add_env.md	2017-11-14 17:57:55 +02:00
Zach Dwiel	9ae2905a76	clean up input embeddings setup	2017-11-14 17:39:18 +02:00
Itai Caspi	1ff0da2165	bug fix - fixed an issue with gifs dumping and bumped up Pillow version to 4.3.0	2017-11-13 12:22:42 +02:00
Miguel Morales	acd2b78a9e	Update README.md Fix algorithms list to be consistent with "<full name> (<acronym>)"	2017-11-12 16:00:00 +02:00
Itai Caspi	8d9ee4ea2b	bug fix - fixed C51 presets hyperparameters	2017-11-10 13:22:42 +02:00
galleibo-intel	3c330768f0	Fix for NEC not saving the DND when saving a model	2017-11-09 19:13:23 +02:00
Itai Caspi	f5d645d8a6	resize training curves images	2017-11-09 09:13:12 +02:00
Itai Caspi	8ee9e46083	fixing some typos in the benchmarks README	2017-11-09 08:58:52 +02:00
Itai Caspi	c798be7bfb	added training curves for some of the presets	2017-11-09 08:54:34 +02:00
cxx	84e536d371	Fix std calculation using unbiased estimation in sharing stat mode.	2017-11-07 20:19:54 +02:00
galleibo-intel	f47b8092af	fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore	2017-11-06 19:41:32 +02:00
Itai Caspi	b40259c61a	bug fix - remove import warning when everything was imported successfully + changed global step api to match TF 1.4	2017-11-06 17:28:13 +02:00
Itai Caspi	fd103a7b69	updated the algorithms diagram with QR-DQN	2017-11-01 15:24:54 +02:00
Itai Caspi	a8bce9828c	new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf ) API change - Distributional DQN renamed to Categorical DQN	2017-11-01 15:09:07 +02:00
Itai Caspi	1ad6262307	bug fix - correcting the evaluation exploration control parameter logging	2017-10-31 13:50:40 +02:00
Itai Caspi	e38611b9eb	bug fix - updating Doom_Health_DFP and Breakout_DQN presets	2017-10-31 10:54:14 +02:00
Itai Caspi	913ab75e8a	bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head	2017-10-31 10:51:48 +02:00
Itai Caspi	1918f16079	imporved API for getting / setting variables within the graph	2017-10-31 10:51:48 +02:00
cxx	e33b0e8534	Fix preset mistakes.	2017-10-26 12:37:32 +03:00
cxx	f43c951c2d	Unify base class using new-style (object).	2017-10-26 12:33:09 +03:00
Itai Caspi	39cf78074c	preventing the evaluation agent from getting stuck in bad policies by updating from the global network during episodes	2017-10-25 10:28:45 +03:00
Itai Caspi	d3c6860421	removing some Gym wrapper code that was required for OpenAI Universe in the past	2017-10-25 10:01:10 +03:00
Gal Leibovich	eb0b57d7fa	Updating PPO references per issue #11	2017-10-24 16:57:44 +03:00
Itai Caspi	43bc359166	updated atari presets with v4 environment ids	2017-10-23 14:14:09 +03:00

1 2 3

124 Commits