Itai Caspi
|
913ab75e8a
|
bug fix - preventing crashes when the probability of one of the actions is 0 in the policy head
|
2017-10-31 10:51:48 +02:00 |
|
Itai Caspi
|
1918f16079
|
imporved API for getting / setting variables within the graph
|
2017-10-31 10:51:48 +02:00 |
|
cxx
|
f43c951c2d
|
Unify base class using new-style (object).
|
2017-10-26 12:33:09 +03:00 |
|
Itai Caspi
|
39cf78074c
|
preventing the evaluation agent from getting stuck in bad policies by updating from the global network during episodes
|
2017-10-25 10:28:45 +03:00 |
|
Gal Leibovich
|
eb0b57d7fa
|
Updating PPO references per issue #11
|
2017-10-24 16:57:44 +03:00 |
|
Gal Leibovich
|
1d4c3455e7
|
coach v0.8.0
|
2017-10-19 13:10:15 +03:00 |
|