Gal Leibovich
9e9c4fd332
Create a dataset using an agent ( #306 )
...
Generate a dataset using an agent (allowing to select between this and a random dataset)
2019-05-28 09:34:49 +03:00
Gal Leibovich
acceb03ac0
bug fixes for OPE ( #311 )
2019-05-21 16:39:11 +03:00
Gal Leibovich
582921ffe3
OPE: Weighted Importance Sampling ( #299 )
2019-05-02 19:25:42 +03:00
Gal Leibovich
4741b0b916
BCQ variant on top of DDQN ( #276 )
...
* kNN based model for predicting which actions to drop
* fix for seeds with batch rl
2019-04-16 17:06:23 +03:00
Gal Leibovich
6e08c55ad5
Enabling-more-agents-for-Batch-RL-and-cleanup ( #258 )
...
allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups
2019-03-21 16:10:29 +02:00
Gal Leibovich
e3c7e526c7
Batch RL ( #238 )
2019-03-19 18:07:09 +02:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00