coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-03-19 08:23:33 +01:00

Author	SHA1	Message	Date
Gal Leibovich	7eb884c5b2	TD3 (#338 )	2019-06-16 11:11:21 +03:00
guyk1971	74db141d5e	SAC algorithm (#282 ) * SAC algorithm * SAC - updates to agent (learn_from_batch), sac_head and sac_q_head to fix problem in gradient calculation. Now SAC agents is able to train. gym_environment - fixing an error in access to gym.spaces * Soft Actor Critic - code cleanup * code cleanup * V-head initialization fix * SAC benchmarks * SAC Documentation * typo fix * documentation fixes * documentation and version update * README typo	2019-05-01 18:37:49 +03:00
shadiendrawis	2b5d1dabe6	ACER algorithm (#184 ) * initial ACER commit * Code cleanup + several fixes * Q-retrace bug fix + small clean-ups * added documentation for acer * ACER benchmarks * update benchmarks table * Add nightly running of golden and trace tests. (#202) Resolves #200 * comment out nightly trace tests until values reset. * remove redundant observe ignore (#168) * ensure nightly test env containers exist. (#205) Also bump integration test timeout * wxPython removal (#207) Replacing wxPython with Python's Tkinter. Also removing the option to choose multiple files as it is unused and causes errors, and fixing the load file/directory spinner. * Create CONTRIBUTING.md (#210) * Create CONTRIBUTING.md. Resolves #188 * run nightly golden tests sequentially. (#217) Should reduce resource requirements and potential CPU contention but increases overall execution time. * tests: added new setup configuration + test args (#211) - added utils for future tests and conftest - added test args * new docs build * golden test update	2019-02-20 23:52:34 +02:00
Gal Novik	4e0d018d5f	updated algorithms image in README	2018-11-27 23:12:13 +02:00
Gal Leibovich	a4471389a4	brightened starcraft.gif	2018-08-20 13:50:09 +03:00
Gal Leibovich	5e275e9795	update starcraft gif	2018-08-20 11:49:19 +03:00
Itai Caspi	1de04d6fee	updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets	2018-08-16 18:23:32 +03:00
Gal Leibovich	8f99409387	updating algorithms.png for README	2018-08-16 16:46:26 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00
Itai Caspi	125c7ee38d	Release 0.9 Main changes are detailed below: New features - * CARLA 0.7 simulator integration * Human control of the game play * Recording of human game play and storing / loading the replay buffer * Behavioral cloning agent and presets * Golden tests for several presets * Selecting between deep / shallow image embedders * Rendering through pygame (with some boost in performance) API changes - * Improved environment wrapper API * Added an evaluate flag to allow convenient evaluation of existing checkpoints * Improve frameskip definition in Gym Bug fixes - * Fixed loading of checkpoints for agents with more than one network * Fixed the N Step Q learning agent python3 compatibility	2017-12-19 19:27:16 +02:00
Itai Caspi	fd103a7b69	updated the algorithms diagram with QR-DQN	2017-11-01 15:24:54 +02:00
Gal Leibovich	1d4c3455e7	coach v0.8.0	2017-10-19 13:10:15 +03:00

12 Commits