1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-06 13:54:21 +01:00
Files
coach/rl_coach/traces/Atari_QR_DQN_pong/trace.csv
2018-10-02 17:55:16 +03:00

2.2 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinDiscounted Return/MeanDiscounted Return/StdevDiscounted Return/MaxDiscounted Return/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.01117.01117.01117.01117.01.00.0-1.51802298949955670.6998808293377133-0.08930329112720292-3.148474706421977
32205.00.01937.01937.0820.01937.00.9991882000000176-21.0-21.00.036.3444552456460341.993139542609136200.135070800781252.5736601352691655.000000000000001e-056.776263578034403e-215e-055e-0512.743299510.76646600000000181.033633.42582-2.33613429220885040.784322378590693-0.38878391807422696-3.369599601005491
43413.00.02768.02768.0831.02768.00.9983655100000356-21.0-21.00.037.54344541178341.50272860065348266.242584228515572.5528073310852055.0000000000000016e-051.3552527156068802e-205e-055e-0538.6763134.089546285.271338.399524000000001-2.3203942011818890.6047235028955231-0.7105532272722921-3.350537576335216-0.0186426229352461420.009556171435240226-0.004845094565243928-0.03212798437627499
54667.00.03783.03783.01015.03783.00.9973606600000572-20.0-20.00.035.14584457780432633.76140476143935133.999099731445283.208557128906255.000000000000001e-056.776263578034403e-215e-055e-0551.62308499999999626.665253000000003171.2061625.264019-1.75313578374496770.7448577440634202-0.1288331810939122-3.2971074888190803-0.048965960125072650.012271451729615605-0.028249878564020038-0.0641258182382444
65867.00.04585.04585.0802.04585.00.9965666800000744-21.0-21.00.033.2186135673522933.802230749750514171.116485595703123.19883966445922855.000000000000001e-056.776263578034403e-215e-055e-0551.51096729.626279999999998225.3259627.11263-2.4064658374132590.5636980823469648-0.7105532272722921-3.36383697254212-0.036165844109424760.011201886243875388-0.015056172198383141-0.05481453027576209