coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-01 21:35:45 +01:00

Author	SHA1	Message	Date
Gal Leibovich	d6795bd524	batchnorm fixes + disabling batchnorm in DDPG (#353 ) Co-authored-by: James Casbon <casbon+gh@gmail.com>	2019-06-23 11:28:22 +03:00
Gal Leibovich	7eb884c5b2	TD3 (#338 )	2019-06-16 11:11:21 +03:00
guyk1971	74db141d5e	SAC algorithm (#282 ) * SAC algorithm * SAC - updates to agent (learn_from_batch), sac_head and sac_q_head to fix problem in gradient calculation. Now SAC agents is able to train. gym_environment - fixing an error in access to gym.spaces * Soft Actor Critic - code cleanup * code cleanup * V-head initialization fix * SAC benchmarks * SAC Documentation * typo fix * documentation fixes * documentation and version update * README typo	2019-05-01 18:37:49 +03:00
shadiendrawis	f03bd7ad93	benchmark update (#250 )	2019-03-17 15:33:28 +02:00
shadiendrawis	2b5d1dabe6	ACER algorithm (#184 ) * initial ACER commit * Code cleanup + several fixes * Q-retrace bug fix + small clean-ups * added documentation for acer * ACER benchmarks * update benchmarks table * Add nightly running of golden and trace tests. (#202) Resolves #200 * comment out nightly trace tests until values reset. * remove redundant observe ignore (#168) * ensure nightly test env containers exist. (#205) Also bump integration test timeout * wxPython removal (#207) Replacing wxPython with Python's Tkinter. Also removing the option to choose multiple files as it is unused and causes errors, and fixing the load file/directory spinner. * Create CONTRIBUTING.md (#210) * Create CONTRIBUTING.md. Resolves #188 * run nightly golden tests sequentially. (#217) Should reduce resource requirements and potential CPU contention but increases overall execution time. * tests: added new setup configuration + test args (#211) - added utils for future tests and conftest - added test args * new docs build * golden test update	2019-02-20 23:52:34 +02:00
Gal Novik	0fa9d8e602	Update README.md (#182 )	2019-01-08 13:48:17 +02:00
Itai Caspi	d59a700248	updated benchmarks for pong and breakout for dueling ddqn with PER	2018-09-06 14:05:46 +03:00
Gal Leibovich	08a557bfd1	updated the benchmarks for space invaders with dueling ddqn variants	2018-09-06 12:13:49 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
Shadi Endrawis	07db625987	Running trace tests in parallel + other small fixes	2018-08-30 19:35:10 +03:00
Itai Caspi	c5165cd7d6	benchmarks and pip package updates	2018-08-19 14:23:20 +03:00
Gal Leibovich	23d2945bf8	Update README.md	2018-08-19 11:02:45 +03:00
Itai Caspi	e2e8143b94	additional benchmarks for dqn and a3c	2018-08-18 15:21:50 +03:00
Itai Caspi	2d5688c737	additional benchmarks for a3c and dqn	2018-08-16 20:01:35 +03:00
Itai Caspi	1de04d6fee	updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets	2018-08-16 18:23:32 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00
Itai Caspi	a7206ed702	Multiple improvements and bug fixes (#66 ) * Multiple improvements and bug fixes: * Using lazy stacking to save on memory when using a replay buffer * Remove step counting for evaluation episodes * Reset game between heatup and training * Major bug fixes in NEC (is reproducing the paper results for pong now) * Image input rescaling to 0-1 is now optional * Change the terminal title to be the experiment name * Observation cropping for atari is now optional * Added random number of noop actions for gym to match the dqn paper * Fixed a bug where the evaluation episodes won't start with the max possible ale lives * Added a script for plotting the results of an experiment over all the atari games	2018-02-26 12:29:07 +02:00
Itai Caspi	f5d645d8a6	resize training curves images	2017-11-09 09:13:12 +02:00
Itai Caspi	8ee9e46083	fixing some typos in the benchmarks README	2017-11-09 08:58:52 +02:00
Itai Caspi	c798be7bfb	added training curves for some of the presets	2017-11-09 08:54:34 +02:00

20 Commits