mirror of
https://github.com/gryf/coach.git
synced 2026-03-20 00:43:34 +01:00
* override episode rewards with the last transition reward * EWMA normalization filter * allowing control over when the pre_network filter runs
* override episode rewards with the last transition reward * EWMA normalization filter * allowing control over when the pre_network filter runs