1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00

Corrected MXNet's PPO Head for Continuous Action Spaces (#84)

* Changes required for Continuous PPO Head with MXNet. Used in MountainCarContinuous_ClippedPPO.

* Simplified changes for continuous ppo.

* Cleaned up to avoid duplicate code, and simplified covariance creation.
This commit is contained in:
Thom Lane
2018-11-15 13:27:54 -08:00
committed by Scott Leishman
parent fde73ced13
commit 3358e04a6a
3 changed files with 25 additions and 19 deletions

View File

@@ -299,8 +299,7 @@ class MxnetArchitecture(Architecture):
assert outputs is None, "outputs must be None"
output = self._predict(inputs)
output = tuple(o.asnumpy() for o in output)
output = list(o.asnumpy() for o in output)
if squeeze_output:
output = squeeze_list(output)
return output