1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-07 14:24:16 +01:00
Files
coach/rl_coach/traces/ControlSuite_DDPG_cartpole_swingup/trace.csv
itaicaspi-intel fa4895f840 new traces
2018-09-13 11:47:36 +03:00

3.1 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinDiscounted Return/MeanDiscounted Return/StdevDiscounted Return/MaxDiscounted Return/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/MinQ/MeanQ/StdevQ/MaxQ/MinTD targets/MeanTD targets/StdevTD targets/MaxTD targets/Minactions/Meanactions/Stdevactions/Maxactions/Min
210.01.01001.01.01001.01001.00.00.00.18105494375849880.083426124582043740.36571557275900550.012114535848885052
320.01.02002.02.01001.02002.00.01.00.105143695483955470.050430657389200540.215244303476182260.0011643643789458708
431000.00.03003.03.01001.03003.0-0.11853024927717786.7460043343246367.460043343246331.01.0688050798876248e-051.766983092613708e-050.00025916425511240961.702799409031286e-060.000100000000000000032.7105054312137605e-200.00010.00010.00455208260.00300557560.0232467429999999970.000617062730.63038985981110530.141915770529554340.80020506708056120.0129179202341290090.000101499150000000010.178386550.24224899999999996-0.37426380.0317110480670170160.066630389050797130.16946774334467746-0.1425395913783225-1.13300653080411350.2760152561198482-0.11754712369609001-1.564363479104263
542001.00.04004.04.01001.04004.0-0.20485102605986767.66855149099738976.685514909973851.03.408961573131819e-056.750553695485743e-050.00070102675817906871.4872452993586194e-060.000100000000000000032.7105054312137605e-200.00010.00010.0109602710.0130244580000000010.095230780.00070279310.69659295267338990.16229943359430640.94533085685046940.0133246654580325370.00277303530.187881280.250535-0.34311860.0295577905996573450.117113639263771750.2386419371743873-0.3346987397531008-1.1132694119600270.1980527596764508-0.6249308459515377-1.5458447591062914
653002.00.05005.05.01001.05005.0-0.021347725354983287.36812275387001173.68122753870010.01.2014473524686764e-051.2546101794472948e-050.000154118795762769881.0827171763594379e-060.000100000000000000032.7105054312137605e-200.00010.00010.0078430230000000010.00510952950.0380925130.00067158364000000010.669961389322160.157035847183748080.89089326103690350.00603125196745417460.0151756610000000020.181278110.24180134-0.315607850.030807471005781050.136849403890572260.2399127439483701-0.3243346170643305-0.93307138334767590.12577776040520755-0.5994658138545264-1.21464647257472