1
0
mirror of https://github.com/gryf/coach.git synced 2026-04-20 15:11:24 +02:00
Commit Graph

110 Commits

Author SHA1 Message Date
Roman Dobosz 26a2f94f43 Added pyyaml dependecy to setup/requirements 2018-04-27 14:48:43 +02:00
Roman Dobosz 676c69e391 Moved coach to its top level module. 2018-04-27 13:25:58 +02:00
Roman Dobosz 7e61bb5685 Removed unnecessary files 2018-04-25 11:59:05 +02:00
Roman Dobosz 5c53f9be02 Added missing imports, correct usages 2018-04-24 13:33:10 +02:00
Roman Dobosz 42a9ec132d Merge branch 'master' into imports 2018-04-24 07:43:04 +02:00
Itai Caspi f31159aad6 bug fixes for carla environment (#93) 2018-04-23 11:13:24 +03:00
Itai Caspi 52eb159f69 multiple bug fixes in dealing with measurements + CartPole_DFP preset (#92) 2018-04-23 10:44:46 +03:00
itaicaspi-intel 5d5562bf62 moving the docs to github 2018-04-23 09:14:20 +03:00
Roman Dobosz 1b095aeeca Cleanup imports.
Till now, most of the modules were importing all of the module objects
(variables, classes, functions, other imports) into module namespace,
which potentially could (and was) cause of unintentional use of class or
methods, which was indirect imported.

With this patch, all the star imports were substituted with top-level
module, which provides desired class or function.

Besides, all imports where sorted (where possible) in a way pep8[1]
suggests - first are imports from standard library, than goes third
party imports (like numpy, tensorflow etc) and finally coach modules.
All of those sections are separated by one empty line.

[1] https://www.python.org/dev/peps/pep-0008/#imports
2018-04-13 09:58:40 +02:00
jtoy cafa152382 update requirements to have valid tornado version (#84) 2018-04-02 14:21:35 +03:00
Itai Caspi a7206ed702 Multiple improvements and bug fixes (#66)
* Multiple improvements and bug fixes:

    * Using lazy stacking to save on memory when using a replay buffer
    * Remove step counting for evaluation episodes
    * Reset game between heatup and training
    * Major bug fixes in NEC (is reproducing the paper results for pong now)
    * Image input rescaling to 0-1 is now optional
    * Change the terminal title to be the experiment name
    * Observation cropping for atari is now optional
    * Added random number of noop actions for gym to match the dqn paper
    * Fixed a bug where the evaluation episodes won't start with the max possible ale lives
    * Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Zach Dwiel 4fe9cba445 remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel eba900067c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel d1bf83047c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel ef46e194af remove unused commented code 2018-02-21 10:05:57 -05:00
Zach Dwiel d9303e731e remove python2 compatibility 2018-02-21 10:05:57 -05:00
Zach Dwiel ec68bd4959 make sure that for now observation spaces all include an observation key 2018-02-21 10:05:57 -05:00
Zach Dwiel 0740ebcdac by default assume state["observation"] is where the image for rendering can be found 2018-02-21 10:05:57 -05:00
Zach Dwiel f9f92a42fd cleanup debugging code 2018-02-21 10:05:57 -05:00
Zach Dwiel 86362683b1 comment 2018-02-21 10:05:57 -05:00
Zach Dwiel 8fc24a2bbe fix bc_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel d8f5a35013 fix qr_dqn_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel e1ad86417f fix n_step_q_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel 5cf10e5f52 fix bug in ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel 8248caf35e fix more agents 2018-02-21 10:05:57 -05:00
Zach Dwiel 98f57a0d87 fix ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel 943e41ba58 fix nec_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel ee6e0bdc3b fix keep_dims -> keepdims 2018-02-21 10:05:57 -05:00
Zach Dwiel 39a28aba95 fix clipped ppo 2018-02-21 10:05:57 -05:00
Zach Dwiel 85afb86893 temp commit 2018-02-21 10:05:57 -05:00
Gal Leibovich 16c5032735 fix for tensorboard visualization slowing execution even when it is off
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi 72d34f4063 adding a flag to prevent summary 2018-02-15 13:47:14 +02:00
Itai Caspi 55c8c87afc allow visualizing the observation + bug fixes to coach summary 2018-02-15 13:47:14 +02:00
Itai Caspi 5d1a2bc392 Adding a summary when exiting coach 2018-02-13 11:11:26 +02:00
Itai Caspi ba96e585d2 appending csv's from logger instead of rewriting them 2018-02-12 14:52:50 +02:00
Itai Caspi 569ca39ce6 Dashboard color selection + removing old legend 2018-02-09 14:52:58 +02:00
Itai Caspi 8a4383e86f Added an improved legend to dashboard 2018-02-08 16:48:46 +02:00
Itai Caspi b071599cb0 updating intel optimized tensorflow to version 1.4 2018-02-08 09:20:29 +02:00
Itai Caspi 462fe9796b several bug fixes in dashboard 2018-02-07 12:49:34 +02:00
galleibo-intel 4025496783 Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet) 2018-02-05 15:48:00 +02:00
Gal Leibovich 7c8962c991 adding support in tensorboard (#52)
* bug-fix in architecture.py where additional fetches would acquire more entries than it should
* change in run_test to allow ignoring some test(s)
2018-02-05 15:21:49 +02:00
Itai Caspi a8d5fb7bdf Added a table of contents to the README 2018-01-27 14:31:53 +02:00
Itai Caspi 522c837e76 Update README.md 2018-01-22 12:15:23 +02:00
Itai Caspi 43821c9630 adding the selu activation 2018-01-22 12:05:43 +02:00
Zach Dwiel fff8c8f568 provide a helpful error message in the event that an exploration policy returns a vector of actions instead of a single action during value optimization agent 2018-01-20 14:11:24 -05:00
Zach Dwiel 40e5c628c6 add options for more verbose test errors 2018-01-16 22:08:46 -05:00
Zach Dwiel 8f026bb46f Merge pull request #42 from NervanaSystems/print_parameters
provide a command line option which prints the tuning_parameters to stdout
2018-01-11 11:28:24 -05:00
Zach Dwiel c7b11f1e9a provide a command line option which prints the tuning_parameters to stdout 2018-01-10 16:28:41 -05:00
Zach Dwiel 9b963c86d0 Merge pull request #41 from NervanaSystems/allow_direct_entry_point
allow specifying gym environments via entry point syntax: module.package:class
2018-01-10 12:18:12 -05:00
Zach Dwiel cc76a9ad70 allow specifying gym environments via entry point syntax: module.package:class 2018-01-10 10:14:23 -05:00