Zach Dwiel
ee6e0bdc3b
fix keep_dims -> keepdims
2018-02-21 10:05:57 -05:00
Zach Dwiel
39a28aba95
fix clipped ppo
2018-02-21 10:05:57 -05:00
Zach Dwiel
85afb86893
temp commit
2018-02-21 10:05:57 -05:00
Gal Leibovich
16c5032735
fix for tensorboard visualization slowing execution even when it is off
...
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi
72d34f4063
adding a flag to prevent summary
2018-02-15 13:47:14 +02:00
Itai Caspi
55c8c87afc
allow visualizing the observation + bug fixes to coach summary
2018-02-15 13:47:14 +02:00
Itai Caspi
5d1a2bc392
Adding a summary when exiting coach
2018-02-13 11:11:26 +02:00
Itai Caspi
ba96e585d2
appending csv's from logger instead of rewriting them
2018-02-12 14:52:50 +02:00
Itai Caspi
569ca39ce6
Dashboard color selection + removing old legend
2018-02-09 14:52:58 +02:00
Itai Caspi
8a4383e86f
Added an improved legend to dashboard
2018-02-08 16:48:46 +02:00
Itai Caspi
b071599cb0
updating intel optimized tensorflow to version 1.4
2018-02-08 09:20:29 +02:00
Itai Caspi
462fe9796b
several bug fixes in dashboard
2018-02-07 12:49:34 +02:00
galleibo-intel
4025496783
Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)
2018-02-05 15:48:00 +02:00
Gal Leibovich
7c8962c991
adding support in tensorboard ( #52 )
...
* bug-fix in architecture.py where additional fetches would acquire more entries than it should
* change in run_test to allow ignoring some test(s)
2018-02-05 15:21:49 +02:00
Itai Caspi
a8d5fb7bdf
Added a table of contents to the README
2018-01-27 14:31:53 +02:00
Itai Caspi
522c837e76
Update README.md
2018-01-22 12:15:23 +02:00
Itai Caspi
43821c9630
adding the selu activation
2018-01-22 12:05:43 +02:00
Zach Dwiel
fff8c8f568
provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent
2018-01-20 14:11:24 -05:00
Zach Dwiel
40e5c628c6
add options for more verbose test errors
2018-01-16 22:08:46 -05:00
Zach Dwiel
8f026bb46f
Merge pull request #42 from NervanaSystems/print_parameters
...
provide a command line option which prints the tuning_parameters to stdout
2018-01-11 11:28:24 -05:00
Zach Dwiel
c7b11f1e9a
provide a command line option which prints the tuning_parameters to stdout
2018-01-10 16:28:41 -05:00
Zach Dwiel
9b963c86d0
Merge pull request #41 from NervanaSystems/allow_direct_entry_point
...
allow specifying gym environments via entry point syntax: module.package:class
2018-01-10 12:18:12 -05:00
Zach Dwiel
cc76a9ad70
allow specifying gym environments via entry point syntax: module.package:class
2018-01-10 10:14:23 -05:00
Itai Caspi
42f68f2e8a
update the README with contact mail + small reformatting
2018-01-09 13:08:23 +02:00
Itai Caspi
eeb3ec5497
fixed the LSTM middleware initialization
2018-01-09 10:26:15 +02:00
Itai Caspi
b435c6d2d7
updated the links to the new Intel AI website
2018-01-09 10:25:06 +02:00
Zach Dwiel
499e78596a
Merge pull request #38 from NervanaSystems/nec_lstm
...
update nec and value optimization agents to work with recurrent middleware
2018-01-08 14:01:34 -05:00
Justin
29857412b3
Add force flag to library symbolic link
...
Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.
2018-01-08 20:36:54 +02:00
Zach Dwiel
6c79a442f2
update nec and value optimization agents to work with recurrent middleware
2018-01-05 20:16:51 -05:00
Itai Caspi
645d9d47a9
Adding bibtex to the README
2018-01-03 21:11:57 +02:00
Itai Caspi
93a54c7e8e
Added a link to the 2nd blog post
2017-12-20 17:18:49 +02:00
Itai Caspi
9e59d1960e
bug fix for dumping gifs from doom
2017-12-20 13:10:34 +02:00
Zach Dwiel
37e317682b
allow missing carla environment and missing matplotlib package
2017-12-20 11:47:14 +02:00
Itai Caspi
125c7ee38d
Release 0.9
...
Main changes are detailed below:
New features -
* CARLA 0.7 simulator integration
* Human control of the game play
* Recording of human game play and storing / loading the replay buffer
* Behavioral cloning agent and presets
* Golden tests for several presets
* Selecting between deep / shallow image embedders
* Rendering through pygame (with some boost in performance)
API changes -
* Improved environment wrapper API
* Added an evaluate flag to allow convenient evaluation of existing checkpoints
* Improve frameskip definition in Gym
Bug fixes -
* Fixed loading of checkpoints for agents with more than one network
* Fixed the N Step Q learning agent python3 compatibility
v0.9.0
2017-12-19 19:27:16 +02:00
Itai Caspi
11faf19649
QR-DQN bug fix and imporvements ( #30 )
...
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss
* improvement - QR-DQN sorting the quantile only once instead of batch_size times
* new feature - adding the Breakout QRDQN preset (verified to achieve good results)
2017-11-29 14:01:59 +02:00
Zach Dwiel
7bdba396d2
Update add_env.md
2017-11-14 17:57:55 +02:00
Zach Dwiel
9ae2905a76
clean up input embeddings setup
2017-11-14 17:39:18 +02:00
Itai Caspi
1ff0da2165
bug fix - fixed an issue with gifs dumping and bumped up Pillow version to 4.3.0
2017-11-13 12:22:42 +02:00
Miguel Morales
acd2b78a9e
Update README.md
...
Fix algorithms list to be consistent with "<full name> (<acronym>)"
2017-11-12 16:00:00 +02:00
Itai Caspi
8d9ee4ea2b
bug fix - fixed C51 presets hyperparameters
2017-11-10 13:22:42 +02:00
galleibo-intel
3c330768f0
Fix for NEC not saving the DND when saving a model
2017-11-09 19:13:23 +02:00
Itai Caspi
f5d645d8a6
resize training curves images
2017-11-09 09:13:12 +02:00
Itai Caspi
8ee9e46083
fixing some typos in the benchmarks README
2017-11-09 08:58:52 +02:00
Itai Caspi
c798be7bfb
added training curves for some of the presets
2017-11-09 08:54:34 +02:00
cxx
84e536d371
Fix std calculation using unbiased estimation in sharing stat mode.
2017-11-07 20:19:54 +02:00
galleibo-intel
f47b8092af
fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore
2017-11-06 19:41:32 +02:00
Itai Caspi
b40259c61a
bug fix - remove import warning when everything was imported successfully + changed global step api to match TF 1.4
2017-11-06 17:28:13 +02:00
Itai Caspi
fd103a7b69
updated the algorithms diagram with QR-DQN
2017-11-01 15:24:54 +02:00
Itai Caspi
a8bce9828c
new feature - implementation of Quantile Regression DQN ( https://arxiv.org/pdf/1710.10044v1.pdf )
...
API change - Distributional DQN renamed to Categorical DQN
2017-11-01 15:09:07 +02:00
Itai Caspi
1ad6262307
bug fix - correcting the evaluation exploration control parameter logging
2017-10-31 13:50:40 +02:00