1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-19 04:00:18 +01:00
Files
coach/rl_coach/traces/ControlSuite_DDPG_hopper_hop/trace.csv
2018-10-02 17:55:16 +03:00

2.7 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinDiscounted Return/MeanDiscounted Return/StdevDiscounted Return/MaxDiscounted Return/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/MinQ/MeanQ/StdevQ/MaxQ/MinTD targets/MeanTD targets/StdevTD targets/MaxTD targets/Minactions/Meanactions/Stdevactions/Maxactions/Min
210.01.01000.01.01000.01000.00.00.00.00.00.00.0
320.01.02000.02.01000.02000.00.01.00.00.00.00.0
43999.00.03000.03.01000.03000.0-0.0176668301791740030.00.01.00.00294645502918620870.00257013777505706420.027887180447578430.00063944933935999870.000100000000000000034.0657581468206416e-200.00010.00010.133577690.1170934441.29910680000000010.0267595050.00.00.00.00.156860340.06273050.39373034-0.3585922-0.00339002041407130770.158757718750687140.5218342781066895-0.58291207194328310.0185592661454462920.83796396528731711.3133153498255412-1.2431993702510542
541999.00.04000.04.01000.04000.0-0.0399993624787529160.00217800763234963230.021780076323496321.00.00069784114267050120.000349566893308957830.0029745472129434350.00013498334737960250.000100000000000000034.0657581468206416e-200.00010.00010.0414898320000000040.021036760.19020870.0102601642.3785131604782346e-050.000217863015494314030.0021669534470054960.00.103485370.0378285270.58358335-0.202459440.15381982773518620.121149660519220520.6158200460672378-0.37445216059684760.081439894078023250.80943441752634351.2337204414679308-1.3201327969582874
652999.00.05000.05.01000.05000.00.171456014834037050.00.00.00.00048586211955439090.00057127903514315130.007098264992237099.616250463295728e-050.000100000000000000034.0657581468206416e-200.00010.00010.0287589080.021000750.236957710.0077611350.00.00.00.0-0.0825600250.0420220760.32945228-0.196822880.205848643777147340.12826416446043440.6959143048524856-0.15129065528511998-0.226365025950535950.77166596781216031.6171153782369072-1.244061515013705