1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-01 13:25:45 +01:00
Commit Graph

80 Commits

Author SHA1 Message Date
Gal Leibovich
16c5032735 fix for tensorboard visualization slowing execution even when it is off
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi
72d34f4063 adding a flag to prevent summary 2018-02-15 13:47:14 +02:00
Itai Caspi
55c8c87afc allow visualizing the observation + bug fixes to coach summary 2018-02-15 13:47:14 +02:00
Itai Caspi
5d1a2bc392 Adding a summary when exiting coach 2018-02-13 11:11:26 +02:00
Itai Caspi
ba96e585d2 appending csv's from logger instead of rewriting them 2018-02-12 14:52:50 +02:00
Itai Caspi
569ca39ce6 Dashboard color selection + removing old legend 2018-02-09 14:52:58 +02:00
Itai Caspi
8a4383e86f Added an improved legend to dashboard 2018-02-08 16:48:46 +02:00
Itai Caspi
b071599cb0 updating intel optimized tensorflow to version 1.4 2018-02-08 09:20:29 +02:00
Itai Caspi
462fe9796b several bug fixes in dashboard 2018-02-07 12:49:34 +02:00
galleibo-intel
4025496783 Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet) 2018-02-05 15:48:00 +02:00
Gal Leibovich
7c8962c991 adding support in tensorboard (#52)
* bug-fix in architecture.py where additional fetches would acquire more entries than it should
* change in run_test to allow ignoring some test(s)
2018-02-05 15:21:49 +02:00
Itai Caspi
a8d5fb7bdf Added a table of contents to the README 2018-01-27 14:31:53 +02:00
Itai Caspi
522c837e76 Update README.md 2018-01-22 12:15:23 +02:00
Itai Caspi
43821c9630 adding the selu activation 2018-01-22 12:05:43 +02:00
Zach Dwiel
fff8c8f568 provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent 2018-01-20 14:11:24 -05:00
Zach Dwiel
40e5c628c6 add options for more verbose test errors 2018-01-16 22:08:46 -05:00
Zach Dwiel
8f026bb46f Merge pull request #42 from NervanaSystems/print_parameters
provide a command line option which prints the tuning_parameters to stdout
2018-01-11 11:28:24 -05:00
Zach Dwiel
c7b11f1e9a provide a command line option which prints the tuning_parameters to stdout 2018-01-10 16:28:41 -05:00
Zach Dwiel
9b963c86d0 Merge pull request #41 from NervanaSystems/allow_direct_entry_point
allow specifying gym environments via entry point syntax: module.package:class
2018-01-10 12:18:12 -05:00
Zach Dwiel
cc76a9ad70 allow specifying gym environments via entry point syntax: module.package:class 2018-01-10 10:14:23 -05:00
Itai Caspi
42f68f2e8a update the README with contact mail + small reformatting 2018-01-09 13:08:23 +02:00
Itai Caspi
eeb3ec5497 fixed the LSTM middleware initialization 2018-01-09 10:26:15 +02:00
Itai Caspi
b435c6d2d7 updated the links to the new Intel AI website 2018-01-09 10:25:06 +02:00
Zach Dwiel
499e78596a Merge pull request #38 from NervanaSystems/nec_lstm
update nec and value optimization agents to work with recurrent middleware
2018-01-08 14:01:34 -05:00
Justin
29857412b3 Add force flag to library symbolic link
Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.
2018-01-08 20:36:54 +02:00
Zach Dwiel
6c79a442f2 update nec and value optimization agents to work with recurrent middleware 2018-01-05 20:16:51 -05:00
Itai Caspi
645d9d47a9 Adding bibtex to the README 2018-01-03 21:11:57 +02:00
Itai Caspi
93a54c7e8e Added a link to the 2nd blog post 2017-12-20 17:18:49 +02:00
Itai Caspi
9e59d1960e bug fix for dumping gifs from doom 2017-12-20 13:10:34 +02:00
Zach Dwiel
37e317682b allow missing carla environment and missing matplotlib package 2017-12-20 11:47:14 +02:00
Itai Caspi
125c7ee38d Release 0.9
Main changes are detailed below:

New features -
* CARLA 0.7 simulator integration
* Human control of the game play
* Recording of human game play and storing / loading the replay buffer
* Behavioral cloning agent and presets
* Golden tests for several presets
* Selecting between deep / shallow image embedders
* Rendering through pygame (with some boost in performance)

API changes -
* Improved environment wrapper API
* Added an evaluate flag to allow convenient evaluation of existing checkpoints
* Improve frameskip definition in Gym

Bug fixes -
* Fixed loading of checkpoints for agents with more than one network
* Fixed the N Step Q learning agent python3 compatibility
v0.9.0
2017-12-19 19:27:16 +02:00
Itai Caspi
11faf19649 QR-DQN bug fix and imporvements (#30)
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss

* improvement - QR-DQN sorting the quantile only once instead of batch_size times

* new feature - adding the Breakout QRDQN preset (verified to achieve good results)
2017-11-29 14:01:59 +02:00
Zach Dwiel
7bdba396d2 Update add_env.md 2017-11-14 17:57:55 +02:00
Zach Dwiel
9ae2905a76 clean up input embeddings setup 2017-11-14 17:39:18 +02:00
Itai Caspi
1ff0da2165 bug fix - fixed an issue with gifs dumping and bumped up Pillow version to 4.3.0 2017-11-13 12:22:42 +02:00
Miguel Morales
acd2b78a9e Update README.md
Fix algorithms list to be consistent with "<full name> (<acronym>)"
2017-11-12 16:00:00 +02:00
Itai Caspi
8d9ee4ea2b bug fix - fixed C51 presets hyperparameters 2017-11-10 13:22:42 +02:00
galleibo-intel
3c330768f0 Fix for NEC not saving the DND when saving a model 2017-11-09 19:13:23 +02:00
Itai Caspi
f5d645d8a6 resize training curves images 2017-11-09 09:13:12 +02:00
Itai Caspi
8ee9e46083 fixing some typos in the benchmarks README 2017-11-09 08:58:52 +02:00
Itai Caspi
c798be7bfb added training curves for some of the presets 2017-11-09 08:54:34 +02:00
cxx
84e536d371 Fix std calculation using unbiased estimation in sharing stat mode. 2017-11-07 20:19:54 +02:00
galleibo-intel
f47b8092af fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore 2017-11-06 19:41:32 +02:00
Itai Caspi
b40259c61a bug fix - remove import warning when everything was imported successfully + changed global step api to match TF 1.4 2017-11-06 17:28:13 +02:00
Itai Caspi
fd103a7b69 updated the algorithms diagram with QR-DQN 2017-11-01 15:24:54 +02:00
Itai Caspi
a8bce9828c new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf)
API change - Distributional DQN renamed to Categorical DQN
2017-11-01 15:09:07 +02:00
Itai Caspi
1ad6262307 bug fix - correcting the evaluation exploration control parameter logging 2017-10-31 13:50:40 +02:00
Itai Caspi
e38611b9eb bug fix - updating Doom_Health_DFP and Breakout_DQN presets 2017-10-31 10:54:14 +02:00
Itai Caspi
913ab75e8a bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head 2017-10-31 10:51:48 +02:00
Itai Caspi
1918f16079 imporved API for getting / setting variables within the graph 2017-10-31 10:51:48 +02:00