1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-28 11:05:46 +01:00
Files
coach/rl_coach/traces/Atari_C51_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

6.6 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.0486.0486.0486.0486.01.00.0
320.01.0573.0573.087.0573.01.00.0
430.01.0722.0722.0149.0722.01.00.0
540.01.01057.01057.0335.01057.01.00.0
6551.00.01260.01260.0203.01260.00.99979903000000445.055.00.03.9317455478743011.1304143308005733e-053.93177008628845173.93168902397155760.000250.00.000250.000250.00158666370.00231474960.0097205160.0007565144
7670.00.01335.01335.075.01335.00.9997247800000062.015.00.03.9317359296899097.235250669185713e-063.93174552917480473.9317119121551520.00025000000000000015.421010862427521e-200.000250.000250.0017559270.00263736370.0096410580000000010.0007148395500000001
8791.00.01422.01422.087.01422.00.99963865000000781.015.00.03.9317012060256235.099185273705093e-053.931736230850223.9315536022186280.00025000000000000015.421010862427521e-200.000250.000250.002825160.00359935640000000030.0094188969999999990.00070036505
98159.00.01693.01693.0271.01693.00.99937036000001365.055.00.03.9316568766067289.226522325134514e-053.9317162036895753.93115997314453120.000250.00.000250.000250.00171650480.00277710540.0125504289999999980.00063557597
109201.00.01861.01861.0168.01861.00.99920404000001723.050.00.03.931581372306460.000104747463166580293.931651353836063.9311587810516360.000250.00.000250.000250.00177995920.0028529030.0098899380000000010.0006321495
1110279.00.02172.02172.0311.02172.00.9988961500000244.065.00.03.93141543253874160.000222790762736763613.9315812587738043.9300706386566160.00025000000000000015.421010862427521e-200.000250.000250.00264446250.0041013680.0194486470.00062298455
1211407.00.02683.02683.0511.02683.00.99839026000003519.0320.00.03.9312407672405240.000158545534500509323.9313738346099863.93056941032409670.00025000000000000011.0842021724855042e-190.000250.000250.00173628890000000020.00291911630.0112329110000000020.00055188160.0241290949285036250.009043202075280250.041695734858513410.012993395328522341
1312424.00.02754.02754.071.02754.00.99831997000003641.015.00.03.93101345791536170.0003588224460653623.93126201629638633.930197000503540.000250.00.000250.000250.00321913040.00456803659999999950.0118465250.0005631294
1413457.00.02886.02886.0132.02886.00.99818929000003922.025.00.03.9310209304094320.000183485204364128243.931163072586063.93019318580627350.000250.00.000250.000250.00160749660000000010.0027081080.0121899389999999990.00064319023
1514504.00.03074.03074.0188.03074.00.99800317000004323.035.00.03.9310257383014850.000153990330151454693.93110084533691363.93000340461730960.00025000000000000015.421010862427521e-200.000250.000250.00098363070000000010.00155281549999999990.0113359160000000010.00052818370.0218905318528420.0144661960294047970.03762449622154298-0.00010702610015811408
1615528.00.03167.03167.093.03167.00.99791110000004521.015.00.03.9310343680174456.681329033129075e-053.93109107017517133.93081307411193850.00025000000000000015.421010862427521e-200.000250.000250.00162580460.00311570030000000040.0117248289999999990.0004990724
1716598.00.03449.03449.0282.03449.00.99763192000005142.055.00.03.9308654376438690.00028316838142691273.93109250068664553.92969322204589750.000250.00.000250.000250.00216391920000000030.00367563030.0124808250.000475806820.03053268765409850.009230450733712620.04385616704821660.021990178897977525
1817668.00.03729.03729.0280.03729.00.99735472000005744.045.00.03.9306016445159910.00034809498308884733.93086528778076173.92922401428222660.000250.00.000250.000250.00270471400000000040.00421215270.0139037480.00065150360.0296864540005729250.019022406610795210.058909925818443724-0.002814809605478502
1918738.00.04008.04008.0279.04008.00.99707851000006343.045.00.03.9302778141839170.00051108645445392173.9306361675262453.92811989784240720.000250.00.000250.000250.00368878659999999970.00511016560.020865260.0008114038499999999
2019792.00.04223.04223.0215.04223.00.99686566000006794.090.00.03.9301340933199280.000473994902433286243.9305229187011723.92849421501159670.000250.00.000250.000250.00375531429999999970.0048913960.01519082950.00101293089999999990.0304843132073686870.0128614651719134040.043398038670421220.011251759901643564
2120893.00.04630.04630.0407.04630.00.996462730000076810.0115.00.03.9297368502852940.0007663112930023253.9303686618804933.9264266490936280.00025000000000000015.421010862427521e-200.000250.000250.0041803770.0060447417000000010.0312098530.00094574010.03553039214263420.0142153041872774780.051354527473450260.012931596487761264
2221937.00.04805.04805.0175.04805.00.99628948000008061.05.00.03.9304791772088340.000236895371534610933.9306843280792243.9294891357421880.000250.00.000250.000250.00217773230.00346561429999999980.0144145350.00058991817
23221022.00.05146.05146.0341.05146.00.9959518900000883.065.00.03.930464410781860.000357905790758338153.93072628974914553.9290759563446050.00025000000000000015.421010862427521e-200.000250.000250.00245916519999999980.00414252750.0149869680.00047587089999999996
24231120.00.05538.05538.0392.05538.00.99556381000009646.080.00.03.93008188119868640.00049555481556748633.9304697513580323.9282214641571050.00025000000000000015.421010862427521e-200.000250.000250.00271197690.00428048470.0200225450.000579324670.0354768233373767140.0140144760775346260.049271919205785470.007030519843102157
25241165.00.05718.05718.0180.05718.00.99538561000010023.055.00.03.93005857142535130.000534086620968113.93051528930664063.9284012317657470.000250.00.000250.000250.00331727760.0048634772999999990.0160103530.0006072133999999999
26251190.00.05815.05815.097.05815.00.99528958000010240.00.00.03.92998563249905870.00040476713930241333.93024563789367683.92859673500060950.000250.00.000250.000250.00305631760.0044018159999999990.0161883760.0011075826