1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00
Files
coach/rl_coach
Gal Leibovich a1bb8eef89 DDPG Critic Head Bug Fix (#344)
* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean
2019-06-05 17:47:56 +03:00
..
2019-06-05 17:47:56 +03:00
2019-03-19 18:07:09 +02:00
2018-08-13 17:11:34 +03:00
2019-03-19 18:07:09 +02:00
2018-08-13 17:11:34 +03:00
2018-11-27 22:43:40 +02:00
2018-08-13 17:11:34 +03:00
2019-03-19 18:07:09 +02:00