1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 03:30:19 +01:00
Commit Graph

102 Commits

Author SHA1 Message Date
itaicaspi-intel
5d5562bf62 moving the docs to github 2018-04-23 09:14:20 +03:00
jtoy
cafa152382 update requirements to have valid tornado version (#84) 2018-04-02 14:21:35 +03:00
Itai Caspi
a7206ed702 Multiple improvements and bug fixes (#66)
* Multiple improvements and bug fixes:

    * Using lazy stacking to save on memory when using a replay buffer
    * Remove step counting for evaluation episodes
    * Reset game between heatup and training
    * Major bug fixes in NEC (is reproducing the paper results for pong now)
    * Image input rescaling to 0-1 is now optional
    * Change the terminal title to be the experiment name
    * Observation cropping for atari is now optional
    * Added random number of noop actions for gym to match the dqn paper
    * Fixed a bug where the evaluation episodes won't start with the max possible ale lives
    * Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Zach Dwiel
4fe9cba445 remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel
eba900067c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel
d1bf83047c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel
ef46e194af remove unused commented code 2018-02-21 10:05:57 -05:00
Zach Dwiel
d9303e731e remove python2 compatibility 2018-02-21 10:05:57 -05:00
Zach Dwiel
ec68bd4959 make sure that for now observation spaces all include an observation key 2018-02-21 10:05:57 -05:00
Zach Dwiel
0740ebcdac by default assume state["observation"] is where the image for rendering can be found 2018-02-21 10:05:57 -05:00
Zach Dwiel
f9f92a42fd cleanup debugging code 2018-02-21 10:05:57 -05:00
Zach Dwiel
86362683b1 comment 2018-02-21 10:05:57 -05:00
Zach Dwiel
8fc24a2bbe fix bc_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel
d8f5a35013 fix qr_dqn_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel
e1ad86417f fix n_step_q_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel
5cf10e5f52 fix bug in ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel
8248caf35e fix more agents 2018-02-21 10:05:57 -05:00
Zach Dwiel
98f57a0d87 fix ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel
943e41ba58 fix nec_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel
ee6e0bdc3b fix keep_dims -> keepdims 2018-02-21 10:05:57 -05:00
Zach Dwiel
39a28aba95 fix clipped ppo 2018-02-21 10:05:57 -05:00
Zach Dwiel
85afb86893 temp commit 2018-02-21 10:05:57 -05:00
Gal Leibovich
16c5032735 fix for tensorboard visualization slowing execution even when it is off
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi
72d34f4063 adding a flag to prevent summary 2018-02-15 13:47:14 +02:00
Itai Caspi
55c8c87afc allow visualizing the observation + bug fixes to coach summary 2018-02-15 13:47:14 +02:00
Itai Caspi
5d1a2bc392 Adding a summary when exiting coach 2018-02-13 11:11:26 +02:00
Itai Caspi
ba96e585d2 appending csv's from logger instead of rewriting them 2018-02-12 14:52:50 +02:00
Itai Caspi
569ca39ce6 Dashboard color selection + removing old legend 2018-02-09 14:52:58 +02:00
Itai Caspi
8a4383e86f Added an improved legend to dashboard 2018-02-08 16:48:46 +02:00
Itai Caspi
b071599cb0 updating intel optimized tensorflow to version 1.4 2018-02-08 09:20:29 +02:00
Itai Caspi
462fe9796b several bug fixes in dashboard 2018-02-07 12:49:34 +02:00
galleibo-intel
4025496783 Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet) 2018-02-05 15:48:00 +02:00
Gal Leibovich
7c8962c991 adding support in tensorboard (#52)
* bug-fix in architecture.py where additional fetches would acquire more entries than it should
* change in run_test to allow ignoring some test(s)
2018-02-05 15:21:49 +02:00
Itai Caspi
a8d5fb7bdf Added a table of contents to the README 2018-01-27 14:31:53 +02:00
Itai Caspi
522c837e76 Update README.md 2018-01-22 12:15:23 +02:00
Itai Caspi
43821c9630 adding the selu activation 2018-01-22 12:05:43 +02:00
Zach Dwiel
fff8c8f568 provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent 2018-01-20 14:11:24 -05:00
Zach Dwiel
40e5c628c6 add options for more verbose test errors 2018-01-16 22:08:46 -05:00
Zach Dwiel
8f026bb46f Merge pull request #42 from NervanaSystems/print_parameters
provide a command line option which prints the tuning_parameters to stdout
2018-01-11 11:28:24 -05:00
Zach Dwiel
c7b11f1e9a provide a command line option which prints the tuning_parameters to stdout 2018-01-10 16:28:41 -05:00
Zach Dwiel
9b963c86d0 Merge pull request #41 from NervanaSystems/allow_direct_entry_point
allow specifying gym environments via entry point syntax: module.package:class
2018-01-10 12:18:12 -05:00
Zach Dwiel
cc76a9ad70 allow specifying gym environments via entry point syntax: module.package:class 2018-01-10 10:14:23 -05:00
Itai Caspi
42f68f2e8a update the README with contact mail + small reformatting 2018-01-09 13:08:23 +02:00
Itai Caspi
eeb3ec5497 fixed the LSTM middleware initialization 2018-01-09 10:26:15 +02:00
Itai Caspi
b435c6d2d7 updated the links to the new Intel AI website 2018-01-09 10:25:06 +02:00
Zach Dwiel
499e78596a Merge pull request #38 from NervanaSystems/nec_lstm
update nec and value optimization agents to work with recurrent middleware
2018-01-08 14:01:34 -05:00
Justin
29857412b3 Add force flag to library symbolic link
Link step fails if continuing installation after interruption, requiring manual deletion of the link. Adding a force flag overrides the existing symbolic link from attempted installation in the newly created virtual environment.
2018-01-08 20:36:54 +02:00
Zach Dwiel
6c79a442f2 update nec and value optimization agents to work with recurrent middleware 2018-01-05 20:16:51 -05:00
Itai Caspi
645d9d47a9 Adding bibtex to the README 2018-01-03 21:11:57 +02:00
Itai Caspi
93a54c7e8e Added a link to the 2nd blog post 2017-12-20 17:18:49 +02:00