1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-10 15:54:12 +01:00
Files
coach/rl_coach/architectures/tensorflow_components/heads
Gal Leibovich a1bb8eef89 DDPG Critic Head Bug Fix (#344)
* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean
2019-06-05 17:47:56 +03:00
..
2019-06-05 17:47:56 +03:00
2018-11-09 08:17:04 -08:00
2019-06-05 17:47:56 +03:00
2019-05-01 18:37:49 +03:00
2019-05-01 18:37:49 +03:00
2019-05-01 18:37:49 +03:00