cxx
|
84e536d371
|
Fix std calculation using unbiased estimation in sharing stat mode.
|
2017-11-07 20:19:54 +02:00 |
|
Itai Caspi
|
b40259c61a
|
bug fix - remove import warning when everything was imported successfully + changed global step api to match TF 1.4
|
2017-11-06 17:28:13 +02:00 |
|
Itai Caspi
|
a8bce9828c
|
new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf)
API change - Distributional DQN renamed to Categorical DQN
|
2017-11-01 15:09:07 +02:00 |
|
Itai Caspi
|
913ab75e8a
|
bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head
|
2017-10-31 10:51:48 +02:00 |
|
Itai Caspi
|
1918f16079
|
imporved API for getting / setting variables within the graph
|
2017-10-31 10:51:48 +02:00 |
|
cxx
|
f43c951c2d
|
Unify base class using new-style (object).
|
2017-10-26 12:33:09 +03:00 |
|
Gal Leibovich
|
1d4c3455e7
|
coach v0.8.0
|
2017-10-19 13:10:15 +03:00 |
|