1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-07 22:34:23 +01:00
Files
coach/rl_coach/traces/ControlSuite_DDPG_cartpole_swingup/trace.csv
2018-08-20 13:01:30 +03:00

2.6 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/MinQ/MeanQ/StdevQ/MaxQ/MinTD targets/MeanTD targets/StdevTD targets/MaxTD targets/Minactions/Meanactions/Stdevactions/Maxactions/Min
210.01.01001.01.01001.01001.00.00.0
320.01.02002.02.01001.02002.00.01.0
431000.00.03003.03.01001.03003.0-0.11853024927717788.6270455159129486.27045515912961.01.0509011072599606e-054.393642656353033e-050.00085354025941342131.1514939615153708e-060.000100000000000000032.7105054312137605e-200.00010.00010.0040003890.004471830.0622341860.000479692969999999960.084647050.160140870.45386302-0.260372580.012471605706650260.021538576948446530.08672064238048882-0.049626097812413830.33593499885145770.63680939446047761.3638484370927098-1.3839266445045957
542001.00.04004.04.01001.04004.0-0.204851026059867617.580070175231974175.800701752319981.00.00055093438152050710.00184911375784827920.0237590149044990545.607626462733606e-060.000100000000000000032.7105054312137605e-200.00010.00010.0455379970000000040.091403241.22103210000000020.00102739100000000010.19226570.162435280.44480476-0.25324150.039935824136090730.117287329609084780.5736919507147175-0.264106365010934650.69240213475238650.58927312290232251.3749280698542792-1.507436630113174
653002.00.05005.05.01001.05005.0-0.0213477253549832813.124325999088368131.243259990883640.00.00017039162293968020.0005686761026118580.0048017264343798162.6488642106414773e-060.000100000000000000032.7105054312137605e-200.00010.00010.0142446370.0141740690.107485950.000696061470.387348380.234984190.6344281-0.106788420.098459669998792960.170177267147563950.6471482681083021-0.232085314994695150.85832681631589880.54933965640557961.4005169604031336-1.084873489999208