mirror of
https://github.com/gryf/coach.git
synced 2025-12-18 11:40:18 +01:00
Corrected MXNet's PPO Head for Continuous Action Spaces (#84)
* Changes required for Continuous PPO Head with MXNet. Used in MountainCarContinuous_ClippedPPO. * Simplified changes for continuous ppo. * Cleaned up to avoid duplicate code, and simplified covariance creation.
This commit is contained in:
committed by
Scott Leishman
parent
fde73ced13
commit
3358e04a6a
@@ -299,8 +299,7 @@ class MxnetArchitecture(Architecture):
|
||||
assert outputs is None, "outputs must be None"
|
||||
|
||||
output = self._predict(inputs)
|
||||
|
||||
output = tuple(o.asnumpy() for o in output)
|
||||
output = list(o.asnumpy() for o in output)
|
||||
if squeeze_output:
|
||||
output = squeeze_list(output)
|
||||
return output
|
||||
|
||||
Reference in New Issue
Block a user