1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00
Commit Graph

18 Commits

Author SHA1 Message Date
Gal Novik
2697142d5a Release 1.0.0 (#382)
* Updating README
* Shortening test cycles
2019-07-24 16:10:58 +03:00
Gal Novik
6e7e7f6d3d Update setup.py to 0.12.1 (#337) 2019-05-30 10:13:36 +03:00
guyk1971
74db141d5e SAC algorithm (#282)
* SAC algorithm

* SAC - updates to agent (learn_from_batch), sac_head and sac_q_head to fix problem in gradient calculation. Now SAC agents is able to train.
gym_environment - fixing an error in access to gym.spaces

* Soft Actor Critic - code cleanup

* code cleanup

* V-head initialization fix

* SAC benchmarks

* SAC Documentation

* typo fix

* documentation fixes

* documentation and version update

* README typo
2019-05-01 18:37:49 +03:00
shadiendrawis
a543f10c1a fix Intel tensorflow installation issue (#281)
* fix intel tensorflow installation issue

* update version
2019-04-03 13:03:30 +03:00
Gal Leibovich
8be9ea5dc9 Update setup.py (#245) 2019-03-12 11:08:10 +02:00
Gal Novik
135f02fb46 wxPython removal (#207)
Replacing wxPython with Python's Tkinter.
Also removing the option to choose multiple files as it is unused and causes errors, and fixing the load file/directory spinner.
2019-01-23 20:49:37 +02:00
Scott Leishman
aa1dfd7599 Bump intel optimized tensorflow to 1.12.0 2018-12-14 10:15:19 -05:00
shadiendrawis
ff816b347d aws pip package (#118)
Added support for a rl-coach-slim package.
2018-11-19 14:00:16 +02:00
Scott Leishman
fe6857eabd broaden supported package versions (#50)
* broaden supported package versions.
* fix mxnet variants.
Also back-out tuple deprecation change introduced in prior commit.
* correct CI image deployment on master branch merge.
2018-11-15 15:29:49 +02:00
Scott Leishman
524f8436a2 create per environment Dockerfiles. (#70)
* create per environment Dockerfiles.

Adjust CI setup to better parallelize runs.
Fix a couple of issues in golden and trace tests.
Update a few of the docs.

* bugfix in mmc agent.

Also install kubectl for CI, update badge branch.

* remove integration test parallelism.
2018-11-14 07:40:22 -08:00
Sina Afrooze
5fadb9c18e Adding mxnet components to rl_coach/architectures (#60)
Adding mxnet components to rl_coach architectures.

- Supports PPO and DQN
- Tested with CartPole_PPO and CarPole_DQN
- Normalizing filters don't work right now (see #49) and are disabled in CartPole_PPO preset
- Checkpointing is disabled for MXNet
2018-11-07 17:07:15 +02:00
Ajay Deshpande
ce9838a7d6 Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds 2018-10-23 16:46:04 -04:00
Itai Caspi
72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
itaicaspi-intel
2c62a40466 bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues 2018-09-02 13:38:16 +03:00
Itai Caspi
3a399d1361 Tensorflow 1.10 and python 3.6 (#104)
* updating setup.py to install tensorflow 1.10 both on cpu and on gpu

* allow python 3.6
2018-09-02 10:12:00 +03:00
Itai Caspi
c5165cd7d6 benchmarks and pip package updates 2018-08-19 14:23:20 +03:00
Itai Caspi
0be4a42701 updates needed for the pip package 2018-08-19 10:39:03 +03:00
Gal Novik
19ca5c24b1 pre-release 0.10.0 2018-08-13 17:11:34 +03:00