1
0
mirror of https://github.com/gryf/coach.git synced 2026-04-20 23:41:24 +02:00
Commit Graph

223 Commits

Author SHA1 Message Date
Itai Caspi 0be4a42701 updates needed for the pip package 2018-08-19 10:39:03 +03:00
Itai Caspi e2e8143b94 additional benchmarks for dqn and a3c 2018-08-18 15:21:50 +03:00
Itai Caspi 2d5688c737 additional benchmarks for a3c and dqn 2018-08-16 20:01:35 +03:00
Itai Caspi 1de04d6fee updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets 2018-08-16 18:23:32 +03:00
Gal Leibovich 8f99409387 updating algorithms.png for README 2018-08-16 16:46:26 +03:00
Gal Leibovich ab5a81c7ee fix for dumping movies, without rendering, for pendulum_with_goals 2018-08-14 18:13:44 +03:00
Gal Leibovich e783157b15 Update README.md 2018-08-14 16:16:41 +03:00
Itai Caspi 824fdeee59 Update README with new coach aliases 2018-08-14 14:36:41 +03:00
Gal Leibovich 7a76d63da4 Update README.md 2018-08-13 17:19:47 +03:00
Gal Novik 19ca5c24b1 pre-release 0.10.0 2018-08-13 17:11:34 +03:00
Itai Caspi d44c329bb8 Update README.md 2018-06-25 17:46:01 +03:00
Itai Caspi cfd4fe0faf Update README.md 2018-06-25 17:43:15 +03:00
Gal Leibovich 2807c29f27 fix for measurements in the initial state (fix for DFP) 2018-05-29 16:47:38 +03:00
itaicaspi-intel 7725dabc86 checkpoints bug fix 2018-05-26 17:49:13 +03:00
itaicaspi-intel 462c6e314b bug fix in nec checkpoint saving 2018-05-24 15:15:33 +03:00
Itai Caspi d302168c8c Parallel agents fixes (#95)
* Parallel agents related bug fixes: checkpoint restore, tensorboard integration.
Adding narrow networks support.
Reference code for unlimited number of checkpoints
2018-05-24 14:24:19 +03:00
itaicaspi-intel 6c0b59b4de constraining gym installation to version 0.9.4 2018-05-22 11:01:58 +03:00
itaicaspi-intel a57b7004a8 updating dashboard 2018-05-09 09:26:15 +03:00
Gal Novik dafdb05a7c bug fixes for clippedppo and checkpoints 2018-04-30 15:13:29 +03:00
Itai Caspi f31159aad6 bug fixes for carla environment (#93) 2018-04-23 11:13:24 +03:00
Itai Caspi 52eb159f69 multiple bug fixes in dealing with measurements + CartPole_DFP preset (#92) 2018-04-23 10:44:46 +03:00
itaicaspi-intel 5d5562bf62 moving the docs to github 2018-04-23 09:14:20 +03:00
jtoy cafa152382 update requirements to have valid tornado version (#84) 2018-04-02 14:21:35 +03:00
Itai Caspi a7206ed702 Multiple improvements and bug fixes (#66)
* Multiple improvements and bug fixes:

    * Using lazy stacking to save on memory when using a replay buffer
    * Remove step counting for evaluation episodes
    * Reset game between heatup and training
    * Major bug fixes in NEC (is reproducing the paper results for pong now)
    * Image input rescaling to 0-1 is now optional
    * Change the terminal title to be the experiment name
    * Observation cropping for atari is now optional
    * Added random number of noop actions for gym to match the dqn paper
    * Fixed a bug where the evaluation episodes won't start with the max possible ale lives
    * Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Zach Dwiel 4fe9cba445 remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel eba900067c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel d1bf83047c remove debug 2018-02-21 10:05:57 -05:00
Zach Dwiel ef46e194af remove unused commented code 2018-02-21 10:05:57 -05:00
Zach Dwiel d9303e731e remove python2 compatibility 2018-02-21 10:05:57 -05:00
Zach Dwiel ec68bd4959 make sure that for now observation spaces all include an observation key 2018-02-21 10:05:57 -05:00
Zach Dwiel 0740ebcdac by default assume state["observation"] is where the image for rendering can be found 2018-02-21 10:05:57 -05:00
Zach Dwiel f9f92a42fd cleanup debugging code 2018-02-21 10:05:57 -05:00
Zach Dwiel 86362683b1 comment 2018-02-21 10:05:57 -05:00
Zach Dwiel 8fc24a2bbe fix bc_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel d8f5a35013 fix qr_dqn_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel e1ad86417f fix n_step_q_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel 5cf10e5f52 fix bug in ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel 8248caf35e fix more agents 2018-02-21 10:05:57 -05:00
Zach Dwiel 98f57a0d87 fix ddpg 2018-02-21 10:05:57 -05:00
Zach Dwiel 943e41ba58 fix nec_agent 2018-02-21 10:05:57 -05:00
Zach Dwiel ee6e0bdc3b fix keep_dims -> keepdims 2018-02-21 10:05:57 -05:00
Zach Dwiel 39a28aba95 fix clipped ppo 2018-02-21 10:05:57 -05:00
Zach Dwiel 85afb86893 temp commit 2018-02-21 10:05:57 -05:00
Gal Leibovich 16c5032735 fix for tensorboard visualization slowing execution even when it is off
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi 72d34f4063 adding a flag to prevent summary 2018-02-15 13:47:14 +02:00
Itai Caspi 55c8c87afc allow visualizing the observation + bug fixes to coach summary 2018-02-15 13:47:14 +02:00
Itai Caspi 5d1a2bc392 Adding a summary when exiting coach 2018-02-13 11:11:26 +02:00
Itai Caspi ba96e585d2 appending csv's from logger instead of rewriting them 2018-02-12 14:52:50 +02:00
Itai Caspi 569ca39ce6 Dashboard color selection + removing old legend 2018-02-09 14:52:58 +02:00
Itai Caspi 8a4383e86f Added an improved legend to dashboard 2018-02-08 16:48:46 +02:00