1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-01 21:35:45 +01:00
Files
coach/rl_coach/traces/ControlSuite_DDPG_hopper_hop/trace.csv
2018-08-20 13:01:30 +03:00

2.5 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/MinQ/MeanQ/StdevQ/MaxQ/MinTD targets/MeanTD targets/StdevTD targets/MaxTD targets/Minactions/Meanactions/Stdevactions/Maxactions/Min
210.01.01000.01.01000.01000.00.00.0
320.01.02000.02.01000.02000.00.01.0
43999.00.03000.03.01000.03000.0-0.0176668301791740030.00.01.00.00381915728663895230.00376196069690404140.0265004057437181470.00053517712512984870.000100000000000000034.0657581468206416e-200.00010.00010.149636330.125574921.02960460.022291046000000002-0.060529970.073191170.09228404-0.81788486-0.28787821193209240.182902948768485670.15647711277008056-1.59495520591735840.0019951464768561350.70601229897264141.2513357156871705-1.2238670209506697
541999.00.04000.04.01000.04000.0-0.0399993624787529160.00.01.00.00059998245706810850.00052003968417942510.0073923217132687560.00011556806566659360.000100000000000000032.7105054312137605e-200.00010.00010.0350744460.0239165070000000040.273766640.007415438000000001-0.0558661520.035571980.048398294-0.20761846-0.12281116110059960.100642955238249360.10938632614910604-0.93856373906135560.38130506269092470.85864559889355261.9421025144380244-1.2856749345811207
652999.00.05000.05.01000.05000.00.171456014834037050.00.00.00.000168602313864539628.468335419663504e-050.00088604446500539774.115650153835304e-050.000100000000000000032.7105054312137605e-200.00010.00010.0133166130.00648336349999999950.0550229660.0038480079-0.046383090.028830105-0.0011850878-0.36706513-0.0559440479874419650.0617612079842930.16325088679790498-0.63330494284629820.26290623194621160.76535809796087341.6618100999356682-1.1699750612176198