galleibo-intel
|
3c330768f0
|
Fix for NEC not saving the DND when saving a model
|
2017-11-09 19:13:23 +02:00 |
|
galleibo-intel
|
f47b8092af
|
fix for intel optimized tensorflow on distributed runs + adding coach_env to .gitignore
|
2017-11-06 19:41:32 +02:00 |
|
Itai Caspi
|
a8bce9828c
|
new feature - implementation of Quantile Regression DQN (https://arxiv.org/pdf/1710.10044v1.pdf)
API change - Distributional DQN renamed to Categorical DQN
|
2017-11-01 15:09:07 +02:00 |
|
Itai Caspi
|
913ab75e8a
|
bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head
|
2017-10-31 10:51:48 +02:00 |
|
Itai Caspi
|
1918f16079
|
imporved API for getting / setting variables within the graph
|
2017-10-31 10:51:48 +02:00 |
|
cxx
|
f43c951c2d
|
Unify base class using new-style (object).
|
2017-10-26 12:33:09 +03:00 |
|
Itai Caspi
|
39cf78074c
|
preventing the evaluation agent from getting stuck in bad policies by updating from the global network during episodes
|
2017-10-25 10:28:45 +03:00 |
|
Gal Leibovich
|
eb0b57d7fa
|
Updating PPO references per issue #11
|
2017-10-24 16:57:44 +03:00 |
|
Gal Leibovich
|
1d4c3455e7
|
coach v0.8.0
|
2017-10-19 13:10:15 +03:00 |
|