1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00
Commit Graph

10 Commits

Author SHA1 Message Date
Itai Caspi
a7206ed702 Multiple improvements and bug fixes (#66)
* Multiple improvements and bug fixes:

    * Using lazy stacking to save on memory when using a replay buffer
    * Remove step counting for evaluation episodes
    * Reset game between heatup and training
    * Major bug fixes in NEC (is reproducing the paper results for pong now)
    * Image input rescaling to 0-1 is now optional
    * Change the terminal title to be the experiment name
    * Observation cropping for atari is now optional
    * Added random number of noop actions for gym to match the dqn paper
    * Fixed a bug where the evaluation episodes won't start with the max possible ale lives
    * Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Zach Dwiel
f9f92a42fd cleanup debugging code 2018-02-21 10:05:57 -05:00
Zach Dwiel
85afb86893 temp commit 2018-02-21 10:05:57 -05:00
Itai Caspi
55c8c87afc allow visualizing the observation + bug fixes to coach summary 2018-02-15 13:47:14 +02:00
Itai Caspi
5d1a2bc392 Adding a summary when exiting coach 2018-02-13 11:11:26 +02:00
Itai Caspi
ba96e585d2 appending csv's from logger instead of rewriting them 2018-02-12 14:52:50 +02:00
Itai Caspi
125c7ee38d Release 0.9
Main changes are detailed below:

New features -
* CARLA 0.7 simulator integration
* Human control of the game play
* Recording of human game play and storing / loading the replay buffer
* Behavioral cloning agent and presets
* Golden tests for several presets
* Selecting between deep / shallow image embedders
* Rendering through pygame (with some boost in performance)

API changes -
* Improved environment wrapper API
* Added an evaluate flag to allow convenient evaluation of existing checkpoints
* Improve frameskip definition in Gym

Bug fixes -
* Fixed loading of checkpoints for agents with more than one network
* Fixed the N Step Q learning agent python3 compatibility
2017-12-19 19:27:16 +02:00
Itai Caspi
1ff0da2165 bug fix - fixed an issue with gifs dumping and bumped up Pillow version to 4.3.0 2017-11-13 12:22:42 +02:00
cxx
f43c951c2d Unify base class using new-style (object). 2017-10-26 12:33:09 +03:00
Gal Leibovich
1d4c3455e7 coach v0.8.0 2017-10-19 13:10:15 +03:00