coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 01:46:31 +02:00

Files

T

Gal Leibovich a1bb8eef89 DDPG Critic Head Bug Fix (#344 )

* A bug fix for DDPG, where the update to the policy network was based on the sum of the critic's Q predictions on the batch instead of their mean

2019-06-05 17:47:56 +03:00

2019-05-12 15:40:02 +03:00

2019-06-05 17:47:56 +03:00

2019-03-21 12:57:56 +02:00

__init__.py

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

architecture.py

2019-03-03 15:11:06 +02:00

distributed_tf_utils.py

2018-11-21 16:09:04 +02:00

general_network.py

2019-03-03 15:11:06 +02:00

layers.py

2019-03-21 12:57:56 +02:00

savers.py

2019-04-04 11:09:19 -04:00

shared_variables.py

2018-11-23 16:11:47 +02:00

utils.py

2018-11-06 17:39:29 +02:00