1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 03:30:19 +01:00
Files
coach/rl_coach/agents
Gal Leibovich a1bb8eef89 DDPG Critic Head Bug Fix (#344)
* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean
2019-06-05 17:47:56 +03:00
..
2018-08-13 17:11:34 +03:00
2019-03-17 15:33:28 +02:00
2019-03-19 18:07:09 +02:00
2019-06-05 17:47:56 +03:00
2019-05-21 16:39:11 +03:00
2018-09-12 15:26:16 +03:00
2019-03-19 18:07:09 +02:00