Gal Leibovich
a1bb8eef89
DDPG Critic Head Bug Fix (#344)
* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean
2019-06-05 17:47:56 +03:00
..
2019-05-12 15:40:02 +03:00
2019-06-05 17:47:56 +03:00
2019-03-21 12:57:56 +02:00
2018-08-13 17:11:34 +03:00
2019-03-03 15:11:06 +02:00
2018-11-21 16:09:04 +02:00
2019-03-03 15:11:06 +02:00
2019-03-21 12:57:56 +02:00
2019-04-04 11:09:19 -04:00
2018-11-23 16:11:47 +02:00
2018-11-06 17:39:29 +02:00