Itai Caspi
11faf19649
QR-DQN bug fix and imporvements ( #30 )
...
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss
* improvement - QR-DQN sorting the quantile only once instead of batch_size times
* new feature - adding the Breakout QRDQN preset (verified to achieve good results)
2017-11-29 14:01:59 +02:00
galleibo-intel
3c330768f0
Fix for NEC not saving the DND when saving a model
2017-11-09 19:13:23 +02:00
Itai Caspi
a8bce9828c
new feature - implementation of Quantile Regression DQN ( https://arxiv.org/pdf/1710.10044v1.pdf )
...
API change - Distributional DQN renamed to Categorical DQN
2017-11-01 15:09:07 +02:00
Itai Caspi
913ab75e8a
bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head
2017-10-31 10:51:48 +02:00
Itai Caspi
1918f16079
imporved API for getting / setting variables within the graph
2017-10-31 10:51:48 +02:00
cxx
f43c951c2d
Unify base class using new-style (object).
2017-10-26 12:33:09 +03:00
Gal Leibovich
1d4c3455e7
coach v0.8.0
2017-10-19 13:10:15 +03:00