Guy Jacob
ba20396f63
Update Pillow version ( #481 )
2021-05-09 09:29:48 +03:00
Guy Jacob
a1a2e67fbd
logging screen output to file ( #479 )
...
Co-authored-by: Gal Leibovich <gal.leibovich@intel.com >
2021-05-06 18:02:27 +03:00
Guy Jacob
9106b69227
Add is_on_policy property to agents ( #480 )
2021-05-06 18:02:02 +03:00
Guy Jacob
06bacd9de0
Fix Rust compiler build error (Kubernetes dependency) ( #471 )
...
Update pip version during CircleCI setup stage to resolve Rust compiler build error (as suggested in https://cryptography.io/en/latest/faq.html#installing-cryptography-fails-with-error-can-not-find-rust-compiler )
2021-02-09 15:54:44 +02:00
Guy Jacob
f52ff1784d
Fix breaking change from minio update ( #469 )
...
`ResponseError` replaced by `S3Error` in new minio version
2020-12-15 10:02:16 +02:00
Gal Novik
59e08034c6
Update README.md
2020-11-09 10:25:05 +02:00
Gal Novik
57e809c094
Docs updates following github repo change
2020-11-08 11:54:38 +02:00
Guy Jacob
bc65f1f5fb
Pin Vizdoom version - one more location ( #468 )
2020-11-04 11:37:35 +02:00
Gal Novik
4318fea436
Update requirements.txt ( #466 )
2020-11-04 09:44:30 +02:00
Guy Jacob
fd765e7e38
Pin Vizdoom version ( #467 )
2020-11-03 21:28:25 +02:00
Guy Jacob
103d4477eb
Disable NumPy and TF2 related warnings ( #463 )
2020-09-24 15:11:45 +03:00
Gal Novik
c9738280fd
Require Python 3.6 + Changes to CI configuration ( #452 )
...
* Change build_*_env jobs to pull base image of current "tag"
instead of "master" image
* Change nightly flow so build_*_env jobs now gated by build_base (so
change in previous bullet works in nightly)
* Bugfix in CheckpointDataStore: Call to object.__init__ with
parameters
* Disabling unstable Doom A3C and ACER golden tests
2020-07-26 16:11:22 +03:00
Guy Jacob
a6689b6036
Update cluster name in .circleci/config.yml (now all locations)
2020-06-24 16:18:49 +03:00
Guy Jacob
6658bfa429
Update cluster name in .circleci/config.yml
2020-06-24 15:24:41 +03:00
Gal Novik
f3ce685cb1
Upgrading Pillow version due to security vulnerability ( #444 )
2020-04-22 20:52:24 +03:00
Gal Novik
79b05a8105
Wolpertinger preset failure fix ( #434 )
...
Numpy 1.18 fails to cast float to int as part of the wolpertinger preset run
2020-01-14 16:26:38 +02:00
Dan Elbaz
525a22cb5b
Roll-back bokeh to version 1.0.4 ( #431 )
...
Roll back bokeh to version 1.0.4
2019-12-23 09:33:53 +02:00
Brian Broll
0867d8d0fb
Fixed typo: Nerual -> Neural ( #425 )
2019-11-16 21:13:24 +02:00
shadiendrawis
188b86369a
fix e-greedy in case action values were equal ( #423 )
2019-11-10 17:20:44 +02:00
shadiendrawis
6ca91b9090
add reset internal state to rollout worker ( #421 )
2019-11-03 14:42:51 +02:00
Gal Leibovich
e288a552dd
Update requirements.txt ( #422 )
2019-10-28 18:30:48 +02:00
Gal Leibovich
66fada7f78
Remove assertion from BatchRLGraphManager
2019-10-22 11:54:14 +03:00
shadiendrawis
6db695ad8a
freeze tensorflow version to <= 1.14.0 ( #416 )
2019-10-10 17:47:25 +03:00
shadiendrawis
5ad5a58350
fix atari stack overflow ( #412 )
2019-10-06 18:14:21 +03:00
shadiendrawis
0a712ecc94
Fix numpy shared running stats to support images ( #411 )
2019-10-06 12:16:38 +03:00
Gal Leibovich
79a4161eca
Workaround for dumping gifs through the Python API ( #405 )
2019-09-26 12:21:25 +03:00
Pi Esposito
9e82c06be3
importing heads parameters from the correct file on tutorial #1 ( #403 )
2019-09-24 20:44:49 +03:00
Gal Novik
34bc292e60
Limiting intel-tensorflow version to 1.13.1 to re-enable CI; Updating nightly schedule to run on Saturdays as well
2019-09-23 12:52:00 +03:00
Gal Novik
0704260b5d
Updating EKS cluster name
2019-09-20 16:12:35 +03:00
Gal Novik
b5d66c0942
Removing CARLA docker file from README ( #402 )
2019-09-16 07:17:58 +03:00
Gal Leibovich
c7949d7011
Fix Atari Schedule Heatup
2019-09-08 16:57:38 +03:00
Gal Novik
13a4a09f72
removing weekly tests ( #398 )
2019-09-08 14:04:24 +03:00
Gal Leibovich
138ced23ba
RL in Large Discrete Action Spaces - Wolpertinger Agent ( #394 )
...
* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter
2019-09-08 12:53:49 +03:00
shadiendrawis
fc50398544
typo fix ( #396 )
2019-09-04 12:40:23 +03:00
Zach Dwiel
7b0fccb041
Add RedisDataStore ( #295 )
...
* GraphManager.set_session also sets self.sess
* make sure that GraphManager.fetch_from_worker uses training phase
* remove unnecessary phase setting in training worker
* reorganize rollout worker
* provide default name to GlobalVariableSaver.__init__ since it isn't really used anyway
* allow dividing TrainingSteps and EnvironmentSteps
* add timestamps to the log
* added redis data store
* conflict merge fix
2019-08-28 21:15:58 +03:00
Scott Leishman
34e1c04f29
further CI cluster name updates. ( #387 )
2019-08-06 10:18:07 +03:00
Gal Novik
92460736bc
Updated tutorial and docs ( #386 )
...
Improved getting started tutorial, and updated docs to point to version 1.0.0
2019-08-05 16:46:15 +03:00
Gal Leibovich
c1d1fae342
Distiller's AMC induced changes ( #359 )
...
* override episode rewards with the last transition reward
* EWMA normalization filter
* allowing control over when the pre_network filter runs
2019-08-05 10:24:58 +03:00
Scott Leishman
7df67dafa3
update to point at new CI cluster. ( #385 )
2019-08-04 13:55:04 +03:00
Gal Novik
2697142d5a
Release 1.0.0 ( #382 )
...
* Updating README
* Shortening test cycles
2019-07-24 16:10:58 +03:00
Gal Leibovich
718597ce9a
Fixes to Batch RL tutorial ( #378 )
2019-07-16 11:22:42 +03:00
Gal Novik
0a4cc7e081
Additional cmd line examples ( #377 )
...
Adding command line examples to the Quick Start Guide tutorial
2019-07-15 12:32:59 +03:00
Gal Leibovich
19ad2d60a7
Batch RL Tutorial ( #372 )
2019-07-14 18:43:48 +03:00
Gal Novik
b82414138d
Workaround the OSError due to bad address failure on the CI runs ( #370 )
...
workaround the OSError due to bad address failure on the CI runs
2019-07-07 17:11:19 +03:00
Gal Leibovich
587b74e04a
Remove double call to reset_internal_state() on gym environments ( #364 )
2019-07-02 13:43:23 +03:00
anabwan
a576ab5659
tests: Removed mxnet from functional tests + minor fix on rewards ( #362 )
...
* ci: change workflow
* changed timeout
* fix function reach reward
* print logs
* removing mxnet
* res'
2019-06-27 18:52:29 +03:00
anabwan
30c64d0656
using gym=0.12.5 instead of latest ( #360 )
...
* using gym=0.12.5 instead of latest
* changing docker gym version
* changing dockingfile gym version
2019-06-24 10:34:28 +03:00
Gal Leibovich
d6795bd524
batchnorm fixes + disabling batchnorm in DDPG ( #353 )
...
Co-authored-by: James Casbon <casbon+gh@gmail.com >
2019-06-23 11:28:22 +03:00
anabwan
7b5d6a3f03
tests: stabling functional tests ( #355 )
...
* tests: stabling functional tests
* functional removed
2019-06-20 15:30:47 +03:00
shadiendrawis
8e812ef82f
Coach as a library ( #348 )
...
* CoachInterface + tutorial
* Some improvements and typo fixes
* merge tutorial 0 and 4
* typo fix + additional tutorial changes
* tutorial changes
* added reading signals and experiment path argument
2019-06-19 18:05:03 +03:00