1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00
Commit Graph

14 Commits

Author SHA1 Message Date
Itai Caspi
d59a700248 updated benchmarks for pong and breakout for dueling ddqn with PER 2018-09-06 14:05:46 +03:00
Gal Leibovich
08a557bfd1 updated the benchmarks for space invaders with dueling ddqn variants 2018-09-06 12:13:49 +03:00
Itai Caspi
72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
Shadi Endrawis
07db625987 Running trace tests in parallel + other small fixes 2018-08-30 19:35:10 +03:00
Itai Caspi
c5165cd7d6 benchmarks and pip package updates 2018-08-19 14:23:20 +03:00
Gal Leibovich
23d2945bf8 Update README.md 2018-08-19 11:02:45 +03:00
Itai Caspi
e2e8143b94 additional benchmarks for dqn and a3c 2018-08-18 15:21:50 +03:00
Itai Caspi
2d5688c737 additional benchmarks for a3c and dqn 2018-08-16 20:01:35 +03:00
Itai Caspi
1de04d6fee updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets 2018-08-16 18:23:32 +03:00
Gal Novik
19ca5c24b1 pre-release 0.10.0 2018-08-13 17:11:34 +03:00
Itai Caspi
a7206ed702 Multiple improvements and bug fixes (#66)
* Multiple improvements and bug fixes:

    * Using lazy stacking to save on memory when using a replay buffer
    * Remove step counting for evaluation episodes
    * Reset game between heatup and training
    * Major bug fixes in NEC (is reproducing the paper results for pong now)
    * Image input rescaling to 0-1 is now optional
    * Change the terminal title to be the experiment name
    * Observation cropping for atari is now optional
    * Added random number of noop actions for gym to match the dqn paper
    * Fixed a bug where the evaluation episodes won't start with the max possible ale lives
    * Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Itai Caspi
f5d645d8a6 resize training curves images 2017-11-09 09:13:12 +02:00
Itai Caspi
8ee9e46083 fixing some typos in the benchmarks README 2017-11-09 08:58:52 +02:00
Itai Caspi
c798be7bfb added training curves for some of the presets 2017-11-09 08:54:34 +02:00