Gal Novik
79b05a8105
Wolpertinger preset failure fix ( #434 )
...
Numpy 1.18 fails to cast float to int as part of the wolpertinger preset run
2020-01-14 16:26:38 +02:00
Gal Leibovich
138ced23ba
RL in Large Discrete Action Spaces - Wolpertinger Agent ( #394 )
...
* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter
2019-09-08 12:53:49 +03:00
Gal Leibovich
c1d1fae342
Distiller's AMC induced changes ( #359 )
...
* override episode rewards with the last transition reward
* EWMA normalization filter
* allowing control over when the pre_network filter runs
2019-08-05 10:24:58 +03:00
Gal Leibovich
19ad2d60a7
Batch RL Tutorial ( #372 )
2019-07-14 18:43:48 +03:00
Gal Leibovich
4c996e147e
applying filters for a csv loaded dataset + some bug-fixes in data loading ( #319 )
2019-05-28 15:44:55 +03:00
Gal Leibovich
30c2b2fc45
moving to skimage.transform.resize ( #321 )
2019-05-23 13:38:01 +03:00
Zach Dwiel
cd812b0d25
more clear names for methods of Space ( #181 )
...
* rename Space.val_matches_space_definition -> contains; Space.is_point_in_space_shape -> valid_index
* rename valid_index -> is_valid_index
2019-01-14 15:02:53 -05:00
Gal Leibovich
4c914c057c
fix for finding the right filter checkpoint to restore + do not update internal filter state when evaluating + fix SharedRunningStats checkpoint filenames ( #147 )
2018-12-17 21:36:27 +02:00
Gal Leibovich
f9ee526536
Fix for issue #128 - circular DQN import ( #130 )
2018-12-16 16:06:44 +02:00
Ryan Peach
28e5b8b612
Minor bugfix on RewardFilter in Readme ( #133 )
2018-11-30 16:02:08 -08:00
Gal Leibovich
a1c56edd98
Fixes for having NumpySharedRunningStats syncing on multi-node ( #139 )
...
1. Having the standard checkpoint prefix in order for the data store to grab it, and sync it to S3.
2. Removing the reference to Redis so that it won't try to pickle that in.
3. Enable restoring a checkpoint into a single-worker run, which was saved by a single-node-multiple-worker run.
2018-11-23 16:11:47 +02:00
Sina Afrooze
87a7848b0a
Moved tf.variable_scope and tf.device calls to framework-specific architecture ( #136 )
2018-11-22 22:52:21 +02:00
Gal Leibovich
a112ee69f6
Save filters' internal state ( #127 )
...
* save filters internal state
* moving the restore to be made from within NumpyRunningStats
2018-11-20 17:21:48 +02:00
Gal Leibovich
6caf721d1c
Numpy shared running stats ( #97 )
2018-11-18 14:46:40 +02:00
Ajay Deshpande
fde73ced13
Simulating the act on the trainer. ( #65 )
...
* Remove the use of daemon threads for Redis subscribe.
* Emulate act and observe on trainer side to update internal vars.
2018-11-15 08:38:58 -08:00
Itai Caspi
6d40ad1650
update of api docstrings across coach and tutorials [WIP] ( #91 )
...
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00
Balaji Subramaniam
a849c17e46
Enable distributed SharedRunningStats ( #81 )
...
- Use Redis pub/sub for updating SharedRunningStats.
2018-11-13 19:17:38 +02:00
itaicaspi-intel
171fe97a3a
imitation related bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00