coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-02 05:45:45 +01:00

Author	SHA1	Message	Date
Itai Caspi	d59a700248	updated benchmarks for pong and breakout for dueling ddqn with PER	2018-09-06 14:05:46 +03:00
Gal Leibovich	08a557bfd1	updated the benchmarks for space invaders with dueling ddqn variants	2018-09-06 12:13:49 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
Shadi Endrawis	07db625987	Running trace tests in parallel + other small fixes	2018-08-30 19:35:10 +03:00
Itai Caspi	c5165cd7d6	benchmarks and pip package updates	2018-08-19 14:23:20 +03:00
Gal Leibovich	23d2945bf8	Update README.md	2018-08-19 11:02:45 +03:00
Itai Caspi	e2e8143b94	additional benchmarks for dqn and a3c	2018-08-18 15:21:50 +03:00
Itai Caspi	2d5688c737	additional benchmarks for a3c and dqn	2018-08-16 20:01:35 +03:00
Itai Caspi	1de04d6fee	updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets	2018-08-16 18:23:32 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00
Itai Caspi	a7206ed702	Multiple improvements and bug fixes (#66 ) * Multiple improvements and bug fixes: * Using lazy stacking to save on memory when using a replay buffer * Remove step counting for evaluation episodes * Reset game between heatup and training * Major bug fixes in NEC (is reproducing the paper results for pong now) * Image input rescaling to 0-1 is now optional * Change the terminal title to be the experiment name * Observation cropping for atari is now optional * Added random number of noop actions for gym to match the dqn paper * Fixed a bug where the evaluation episodes won't start with the max possible ale lives * Added a script for plotting the results of an experiment over all the atari games	2018-02-26 12:29:07 +02:00
Itai Caspi	f5d645d8a6	resize training curves images	2017-11-09 09:13:12 +02:00
Itai Caspi	8ee9e46083	fixing some typos in the benchmarks README	2017-11-09 08:58:52 +02:00
Itai Caspi	c798be7bfb	added training curves for some of the presets	2017-11-09 08:54:34 +02:00

14 Commits