coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00

Author	SHA1	Message	Date
shadiendrawis	ff816b347d	aws pip package (#118 ) Added support for a rl-coach-slim package.	2018-11-19 14:00:16 +02:00
Scott Leishman	fe6857eabd	broaden supported package versions (#50 ) * broaden supported package versions. * fix mxnet variants. Also back-out tuple deprecation change introduced in prior commit. * correct CI image deployment on master branch merge.	2018-11-15 15:29:49 +02:00
Scott Leishman	524f8436a2	create per environment Dockerfiles. (#70 ) * create per environment Dockerfiles. Adjust CI setup to better parallelize runs. Fix a couple of issues in golden and trace tests. Update a few of the docs. * bugfix in mmc agent. Also install kubectl for CI, update badge branch. * remove integration test parallelism.	2018-11-14 07:40:22 -08:00
Sina Afrooze	5fadb9c18e	Adding mxnet components to rl_coach/architectures (#60 ) Adding mxnet components to rl_coach architectures. - Supports PPO and DQN - Tested with CartPole_PPO and CarPole_DQN - Normalizing filters don't work right now (see #49) and are disabled in CartPole_PPO preset - Checkpointing is disabled for MXNet	2018-11-07 17:07:15 +02:00
Ajay Deshpande	ce9838a7d6	Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds	2018-10-23 16:46:04 -04:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
itaicaspi-intel	2c62a40466	bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues	2018-09-02 13:38:16 +03:00
Itai Caspi	3a399d1361	Tensorflow 1.10 and python 3.6 (#104 ) * updating setup.py to install tensorflow 1.10 both on cpu and on gpu * allow python 3.6	2018-09-02 10:12:00 +03:00
Itai Caspi	c5165cd7d6	benchmarks and pip package updates	2018-08-19 14:23:20 +03:00
Itai Caspi	0be4a42701	updates needed for the pip package	2018-08-19 10:39:03 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

11 Commits