coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00

Author	SHA1	Message	Date
Gal Novik	2697142d5a	Release 1.0.0 (#382 ) * Updating README * Shortening test cycles	2019-07-24 16:10:58 +03:00
Gal Novik	6e7e7f6d3d	Update setup.py to 0.12.1 (#337 )	2019-05-30 10:13:36 +03:00
guyk1971	74db141d5e	SAC algorithm (#282 ) * SAC algorithm * SAC - updates to agent (learn_from_batch), sac_head and sac_q_head to fix problem in gradient calculation. Now SAC agents is able to train. gym_environment - fixing an error in access to gym.spaces * Soft Actor Critic - code cleanup * code cleanup * V-head initialization fix * SAC benchmarks * SAC Documentation * typo fix * documentation fixes * documentation and version update * README typo	2019-05-01 18:37:49 +03:00
shadiendrawis	a543f10c1a	fix Intel tensorflow installation issue (#281 ) * fix intel tensorflow installation issue * update version	2019-04-03 13:03:30 +03:00
Gal Leibovich	8be9ea5dc9	Update setup.py (#245 )	2019-03-12 11:08:10 +02:00
Gal Novik	135f02fb46	wxPython removal (#207 ) Replacing wxPython with Python's Tkinter. Also removing the option to choose multiple files as it is unused and causes errors, and fixing the load file/directory spinner.	2019-01-23 20:49:37 +02:00
Scott Leishman	aa1dfd7599	Bump intel optimized tensorflow to 1.12.0	2018-12-14 10:15:19 -05:00
shadiendrawis	ff816b347d	aws pip package (#118 ) Added support for a rl-coach-slim package.	2018-11-19 14:00:16 +02:00
Scott Leishman	fe6857eabd	broaden supported package versions (#50 ) * broaden supported package versions. * fix mxnet variants. Also back-out tuple deprecation change introduced in prior commit. * correct CI image deployment on master branch merge.	2018-11-15 15:29:49 +02:00
Scott Leishman	524f8436a2	create per environment Dockerfiles. (#70 ) * create per environment Dockerfiles. Adjust CI setup to better parallelize runs. Fix a couple of issues in golden and trace tests. Update a few of the docs. * bugfix in mmc agent. Also install kubectl for CI, update badge branch. * remove integration test parallelism.	2018-11-14 07:40:22 -08:00
Sina Afrooze	5fadb9c18e	Adding mxnet components to rl_coach/architectures (#60 ) Adding mxnet components to rl_coach architectures. - Supports PPO and DQN - Tested with CartPole_PPO and CarPole_DQN - Normalizing filters don't work right now (see #49) and are disabled in CartPole_PPO preset - Checkpointing is disabled for MXNet	2018-11-07 17:07:15 +02:00
Ajay Deshpande	ce9838a7d6	Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds	2018-10-23 16:46:04 -04:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
itaicaspi-intel	2c62a40466	bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues	2018-09-02 13:38:16 +03:00
Itai Caspi	3a399d1361	Tensorflow 1.10 and python 3.6 (#104 ) * updating setup.py to install tensorflow 1.10 both on cpu and on gpu * allow python 3.6	2018-09-02 10:12:00 +03:00
Itai Caspi	c5165cd7d6	benchmarks and pip package updates	2018-08-19 14:23:20 +03:00
Itai Caspi	0be4a42701	updates needed for the pip package	2018-08-19 10:39:03 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

18 Commits