mirror of
https://github.com/gryf/coach.git
synced 2026-03-05 00:15:50 +01:00
* reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file
7 lines
1.8 KiB
CSV
7 lines
1.8 KiB
CSV
Episode #,Training Iter,In Heatup,ER #Transitions,ER #Episodes,Episode Length,Total steps,Epsilon,Shaped Training Reward,Training Reward,Update Target Network,Evaluation Reward,Shaped Evaluation Reward,Success Rate,Loss/Mean,Loss/Stdev,Loss/Max,Loss/Min,Learning Rate/Mean,Learning Rate/Stdev,Learning Rate/Max,Learning Rate/Min,Grads (unclipped)/Mean,Grads (unclipped)/Stdev,Grads (unclipped)/Max,Grads (unclipped)/Min,Q/Mean,Q/Stdev,Q/Max,Q/Min
|
|
1,0.0,1.0,1117.0,1117.0,1117.0,1117.0,1.0,,,0.0,,,,,,,,,,,,,,,,,,,
|
|
2,205.0,0.0,1937.0,1937.0,820.0,1937.0,0.9991882000000176,-21.0,-21.0,0.0,,,,0.013271789207150194,0.014381215654183937,0.08661144971847534,7.284892490133643e-05,0.0002500000000000001,1.0842021724855042e-19,0.00025,0.00025,0.09793413,0.109029554,1.2459028,0.010081228000000001,,,,
|
|
3,413.0,0.0,2768.0,2768.0,831.0,2768.0,0.9983655100000356,-21.0,-21.0,0.0,,,,0.013095782662258687,0.014563835652836424,0.09017306566238403,4.85398450109642e-05,0.0002500000000000001,1.0842021724855042e-19,0.00025,0.00025,0.06699568,0.10204898,0.9738844000000001,0.005621953000000001,-0.06337769,0.006071376999999999,-0.05691424,-0.07540042
|
|
4,667.0,0.0,3783.0,3783.0,1015.0,3783.0,0.9973606600000572,-20.0,-20.0,0.0,,,,0.014243900448040163,0.012460161619208224,0.05600857362151146,8.375291145057417e-06,0.0002500000000000001,5.421010862427521e-20,0.00025,0.00025,0.08014218,0.05026457,0.24418142,0.0018464670999999999,-0.08484802400000001,0.007937772,-0.07532068,-0.09821871
|
|
5,867.0,0.0,4585.0,4585.0,802.0,4585.0,0.9965666800000744,-21.0,-21.0,0.0,,,,0.0149451127843804,0.012661744241431476,0.057885006070137024,2.08603323699208e-05,0.0002500000000000001,5.421010862427521e-20,0.00025,0.00025,0.084665276,0.07432766,0.39534,0.0034519034000000002,-0.09767585,0.029707237999999997,-0.061746947,-0.13731477
|