Gal Novik
f3ce685cb1
Upgrading Pillow version due to security vulnerability ( #444 )
2020-04-22 20:52:24 +03:00
Gal Novik
79b05a8105
Wolpertinger preset failure fix ( #434 )
...
Numpy 1.18 fails to cast float to int as part of the wolpertinger preset run
2020-01-14 16:26:38 +02:00
Dan Elbaz
525a22cb5b
Roll-back bokeh to version 1.0.4 ( #431 )
...
Roll back bokeh to version 1.0.4
2019-12-23 09:33:53 +02:00
Brian Broll
0867d8d0fb
Fixed typo: Nerual -> Neural ( #425 )
2019-11-16 21:13:24 +02:00
shadiendrawis
188b86369a
fix e-greedy in case action values were equal ( #423 )
2019-11-10 17:20:44 +02:00
shadiendrawis
6ca91b9090
add reset internal state to rollout worker ( #421 )
2019-11-03 14:42:51 +02:00
Gal Leibovich
e288a552dd
Update requirements.txt ( #422 )
2019-10-28 18:30:48 +02:00
Gal Leibovich
66fada7f78
Remove assertion from BatchRLGraphManager
2019-10-22 11:54:14 +03:00
shadiendrawis
6db695ad8a
freeze tensorflow version to <= 1.14.0 ( #416 )
2019-10-10 17:47:25 +03:00
shadiendrawis
5ad5a58350
fix atari stack overflow ( #412 )
2019-10-06 18:14:21 +03:00
shadiendrawis
0a712ecc94
Fix numpy shared running stats to support images ( #411 )
2019-10-06 12:16:38 +03:00
Gal Leibovich
79a4161eca
Workaround for dumping gifs through the Python API ( #405 )
2019-09-26 12:21:25 +03:00
Pi Esposito
9e82c06be3
importing heads parameters from the correct file on tutorial #1 ( #403 )
2019-09-24 20:44:49 +03:00
Gal Novik
34bc292e60
Limiting intel-tensorflow version to 1.13.1 to re-enable CI; Updating nightly schedule to run on Saturdays as well
2019-09-23 12:52:00 +03:00
Gal Novik
0704260b5d
Updating EKS cluster name
2019-09-20 16:12:35 +03:00
Gal Novik
b5d66c0942
Removing CARLA docker file from README ( #402 )
2019-09-16 07:17:58 +03:00
Gal Leibovich
c7949d7011
Fix Atari Schedule Heatup
2019-09-08 16:57:38 +03:00
Gal Novik
13a4a09f72
removing weekly tests ( #398 )
2019-09-08 14:04:24 +03:00
Gal Leibovich
138ced23ba
RL in Large Discrete Action Spaces - Wolpertinger Agent ( #394 )
...
* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter
2019-09-08 12:53:49 +03:00
shadiendrawis
fc50398544
typo fix ( #396 )
2019-09-04 12:40:23 +03:00
Zach Dwiel
7b0fccb041
Add RedisDataStore ( #295 )
...
* GraphManager.set_session also sets self.sess
* make sure that GraphManager.fetch_from_worker uses training phase
* remove unnecessary phase setting in training worker
* reorganize rollout worker
* provide default name to GlobalVariableSaver.__init__ since it isn't really used anyway
* allow dividing TrainingSteps and EnvironmentSteps
* add timestamps to the log
* added redis data store
* conflict merge fix
2019-08-28 21:15:58 +03:00
Scott Leishman
34e1c04f29
further CI cluster name updates. ( #387 )
2019-08-06 10:18:07 +03:00
Gal Novik
92460736bc
Updated tutorial and docs ( #386 )
...
Improved getting started tutorial, and updated docs to point to version 1.0.0
2019-08-05 16:46:15 +03:00
Gal Leibovich
c1d1fae342
Distiller's AMC induced changes ( #359 )
...
* override episode rewards with the last transition reward
* EWMA normalization filter
* allowing control over when the pre_network filter runs
2019-08-05 10:24:58 +03:00
Scott Leishman
7df67dafa3
update to point at new CI cluster. ( #385 )
2019-08-04 13:55:04 +03:00
Gal Novik
2697142d5a
Release 1.0.0 ( #382 )
...
* Updating README
* Shortening test cycles
2019-07-24 16:10:58 +03:00
Gal Leibovich
718597ce9a
Fixes to Batch RL tutorial ( #378 )
2019-07-16 11:22:42 +03:00
Gal Novik
0a4cc7e081
Additional cmd line examples ( #377 )
...
Adding command line examples to the Quick Start Guide tutorial
2019-07-15 12:32:59 +03:00
Gal Leibovich
19ad2d60a7
Batch RL Tutorial ( #372 )
2019-07-14 18:43:48 +03:00
Gal Novik
b82414138d
Workaround the OSError due to bad address failure on the CI runs ( #370 )
...
workaround the OSError due to bad address failure on the CI runs
2019-07-07 17:11:19 +03:00
Gal Leibovich
587b74e04a
Remove double call to reset_internal_state() on gym environments ( #364 )
2019-07-02 13:43:23 +03:00
anabwan
a576ab5659
tests: Removed mxnet from functional tests + minor fix on rewards ( #362 )
...
* ci: change workflow
* changed timeout
* fix function reach reward
* print logs
* removing mxnet
* res'
2019-06-27 18:52:29 +03:00
anabwan
30c64d0656
using gym=0.12.5 instead of latest ( #360 )
...
* using gym=0.12.5 instead of latest
* changing docker gym version
* changing dockingfile gym version
2019-06-24 10:34:28 +03:00
Gal Leibovich
d6795bd524
batchnorm fixes + disabling batchnorm in DDPG ( #353 )
...
Co-authored-by: James Casbon <casbon+gh@gmail.com >
2019-06-23 11:28:22 +03:00
anabwan
7b5d6a3f03
tests: stabling functional tests ( #355 )
...
* tests: stabling functional tests
* functional removed
2019-06-20 15:30:47 +03:00
shadiendrawis
8e812ef82f
Coach as a library ( #348 )
...
* CoachInterface + tutorial
* Some improvements and typo fixes
* merge tutorial 0 and 4
* typo fix + additional tutorial changes
* tutorial changes
* added reading signals and experiment path argument
2019-06-19 18:05:03 +03:00
anabwan
1c90bc22a1
ci: using serial jobs in nightly ( #350 )
2019-06-17 10:53:36 +03:00
Gal Leibovich
7eb884c5b2
TD3 ( #338 )
2019-06-16 11:11:21 +03:00
Timo Kaufmann
8df3c46756
Do not hardcode path to bash ( #332 )
2019-06-10 20:10:28 +03:00
Gal Leibovich
a1bb8eef89
DDPG Critic Head Bug Fix ( #344 )
...
* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean
2019-06-05 17:47:56 +03:00
anabwan
0aa5359d63
tests: added assert for cp param and changing test args order ( #342 )
2019-06-05 00:16:50 +03:00
Gal Novik
e49aac05aa
Update README.md ( #341 )
...
Adding some links to the tutorials from the README
2019-06-04 11:35:34 +03:00
anabwan
f6d5e60eff
Added build base for nightly ( #340 )
...
* Added build base for nightly
* fix requires
* remove commetted code
2019-06-03 23:04:34 +03:00
Gal Novik
6e7e7f6d3d
Update setup.py to 0.12.1 ( #337 )
2019-05-30 10:13:36 +03:00
anabwan
23df868d32
Removed unnecessary futures dependency ( #336 )
2019-05-29 14:34:48 +03:00
Gal Leibovich
4c996e147e
applying filters for a csv loaded dataset + some bug-fixes in data loading ( #319 )
2019-05-28 15:44:55 +03:00
anabwan
6319387357
increase timeout for golden tests ( #335 )
2019-05-28 14:19:11 +03:00
anabwan
f5ba14575c
tests: print logs on failure + fix -cp param ( #327 )
...
* tests: pring logs on failure
* fix import
* added job to circleci
* fix functional
* removed debug job
2019-05-28 13:45:43 +03:00
Gal Leibovich
251dc9ccc0
Preset dependent number of csv read attempts in golden testing ( #334 )
2019-05-28 12:19:57 +03:00
anabwan
ddffac8570
fixed release version ( #333 )
...
* fixed release version
* update docs
2019-05-28 11:11:15 +03:00