1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-28 02:55:46 +01:00
Files
coach/rl_coach/traces/Atari_NStepQ_pong/trace.csv
2018-08-20 13:01:30 +03:00

1.4 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinQ/MeanQ/StdevQ/MaxQ/MinQ Values/MeanQ Values/StdevQ Values/MaxQ Values/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/Min
210.01.01117.01.01117.01117.00.50.0
32164.00.0819.01.0819.01936.00.4919737999999965-21.0-21.00.00.089733270.103089770.35256127-0.283689740.096221410.281729672.9389483.398415e-05
43348.00.0920.01.0920.02856.00.4829577999999926-21.0-21.00.00.0854313450.0380464830.26499447-0.002683470.1021182760.238717021.26443839999999972.8306908e-05
54517.00.0843.01.0843.03699.00.474696399999989-21.0-21.00.00.128692940.027205760.21138280.0575471630.077276170.196024611.39756630.0008099895
65700.00.0913.01.0913.04612.00.4657489999999852-20.0-20.00.00.169596930.0531178450.32967450.0556829460.047738664000000010.145519321.24846789999999997.813723e-06