1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-31 13:05:55 +01:00
Files
coach/rl_coach/traces/Atari_Bootstrapped_DQN_pong/trace.csv
2018-08-20 13:01:30 +03:00

1.4 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.0986.0986.0986.0986.07.00.0
320.01.01806.01806.0820.01806.04.00.0
43207.00.02634.02634.0828.02634.01.0-21.0-21.00.00.0134306944822915050.0127741175140245730.064679197967052460.00050548737635835990.00025000000000000011.0842021724855042e-190.000250.000250.0134625090.0050100040.0321693050.0046610474
54433.00.03538.03538.0904.03538.01.0-21.0-21.00.00.0132142944559939120.0122437767594937710.0485503040254116060.000307276000967249330.00025000000000000011.0842021724855042e-190.000250.000250.0122833485000000010.0046444970.0328481160000000040.0047284905
65664.00.04462.04462.0924.04462.02.0-20.0-20.00.00.0133853601115388850.0139047877204619070.060799419879913320.00050985632697120310.00025000000000000011.0842021724855042e-190.000250.000250.0109436410.00433489540.032608310.00450900480.000665305650.01291220450.024260167000000003-0.034502137