Episode #,Training Iter,In Heatup,ER #Transitions,ER #Episodes,Episode Length,Total steps,Epsilon,Shaped Training Reward,Training Reward,Update Target Network,Evaluation Reward,Shaped Evaluation Reward,Success Rate,Loss/Mean,Loss/Stdev,Loss/Max,Loss/Min,Learning Rate/Mean,Learning Rate/Stdev,Learning Rate/Max,Learning Rate/Min,Grads (unclipped)/Mean,Grads (unclipped)/Stdev,Grads (unclipped)/Max,Grads (unclipped)/Min,Discounted Return/Mean,Discounted Return/Stdev,Discounted Return/Max,Discounted Return/Min,Q/Mean,Q/Stdev,Q/Max,Q/Min 1,0.0,1.0,1117.0,1117.0,1117.0,1117.0,1.0,,,0.0,,,,,,,,,,,,,,,,-1.5180229894995567,0.6998808293377133,-0.08930329112720292,-3.148474706421977,,,, 2,205.0,0.0,1937.0,1937.0,820.0,1937.0,0.9991882000000176,-21.0,-21.0,0.0,,,,36.34445524564603,41.993139542609136,200.13507080078125,2.573660135269165,5.000000000000001e-05,6.776263578034403e-21,5e-05,5e-05,12.7432995,10.766466000000001,81.03363,3.42582,-2.3361342922088504,0.784322378590693,-0.38878391807422696,-3.369599601005491,,,, 3,413.0,0.0,2768.0,2768.0,831.0,2768.0,0.9983655100000356,-21.0,-21.0,0.0,,,,37.543445411783,41.50272860065348,266.24258422851557,2.552807331085205,5.0000000000000016e-05,1.3552527156068802e-20,5e-05,5e-05,38.67631,34.089546,285.27133,8.399524000000001,-2.320394201181889,0.6047235028955231,-0.7105532272722921,-3.350537576335216,-0.018642622935246142,0.009556171435240226,-0.004845094565243928,-0.03212798437627499 4,667.0,0.0,3783.0,3783.0,1015.0,3783.0,0.9973606600000572,-20.0,-20.0,0.0,,,,35.145844577804326,33.76140476143935,133.99909973144528,3.20855712890625,5.000000000000001e-05,6.776263578034403e-21,5e-05,5e-05,51.623084999999996,26.665253000000003,171.20616,25.264019,-1.7531357837449677,0.7448577440634202,-0.1288331810939122,-3.2971074888190803,-0.04896596012507265,0.012271451729615605,-0.028249878564020038,-0.0641258182382444 5,867.0,0.0,4585.0,4585.0,802.0,4585.0,0.9965666800000744,-21.0,-21.0,0.0,,,,33.21861356735229,33.802230749750514,171.11648559570312,3.1988396644592285,5.000000000000001e-05,6.776263578034403e-21,5e-05,5e-05,51.510967,29.626279999999998,225.32596,27.11263,-2.406465837413259,0.5636980823469648,-0.7105532272722921,-3.36383697254212,-0.03616584410942476,0.011201886243875388,-0.015056172198383141,-0.05481453027576209