Gal Leibovich
19ad2d60a7
Batch RL Tutorial ( #372 )
2019-07-14 18:43:48 +03:00
Gal Leibovich
acceb03ac0
bug fixes for OPE ( #311 )
2019-05-21 16:39:11 +03:00
Gal Leibovich
582921ffe3
OPE: Weighted Importance Sampling ( #299 )
2019-05-02 19:25:42 +03:00
Gal Leibovich
6e08c55ad5
Enabling-more-agents-for-Batch-RL-and-cleanup ( #258 )
...
allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups
2019-03-21 16:10:29 +02:00
Gal Leibovich
e3c7e526c7
Batch RL ( #238 )
2019-03-19 18:07:09 +02:00
Ajay Deshpande
6b2de6ba6d
Adding initial interface for backend and redis pubsub ( #19 )
...
* Adding initial interface for backend and redis pubsub
* Addressing comments, adding super in all memories
* Removing distributed experience replay
2018-10-23 16:51:48 -04:00
Zach Dwiel
9f1f9e5ab4
replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)
2018-10-23 16:34:38 -04:00
Zach Dwiel
cccfe88f9b
remove unused method: update_last_transition_info
2018-10-23 16:34:38 -04:00
itaicaspi-intel
607ef17431
added a simple progress bar implementation
2018-09-13 14:21:38 +03:00
itaicaspi-intel
a16d724963
removing some of the presets from the trace tests + more robust replay buffer loading
2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4
load and save function for non-episodic replay buffers + carla improvements + network bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00