1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-09 07:14:19 +01:00
Files
coach/rl_coach/traces/Atari_Bootstrapped_DQN_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

7.4 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.0269.0269.0269.0269.07.00.0
320.01.0531.0531.0262.0531.08.00.0
430.01.0654.0654.0123.0654.00.00.0
540.01.01173.01173.0519.01173.02.00.0
65100.00.01572.01572.0399.01572.08.010.0310.00.00.0062238444185641130.0082940426630874630.031558956950902949.275647607864812e-050.00025000000000000015.421010862427521e-200.000250.000250.0055928960.00434588640.0150063360.0010827626
76160.00.01812.01812.0240.01812.04.07.0130.00.00.0085648035762763660.0109858524473895520.046010125428438190.000102473866718355550.000250.00.000250.000250.00601276939999999950.0050014010.0208677110.0012939627
87185.00.01914.01914.0102.01914.08.02.015.00.00.0057070021558320150.0095758078550948080.031125554814934730.000170166065800003680.000250.00.000250.000250.00473622930.0040794030.0140574110.0019318176000000002
98229.00.02090.02090.0176.02090.07.02.015.00.00.008642160067020720.0131559377019582980.0611718185245990750.000183603362529538540.000250.00.000250.000250.00630047360.00534490360.024874760.0019890803
109244.00.02149.02149.059.02149.05.02.015.00.00.0066693820585247270.00739840375717085940.0154990153387188910.00023972424969542770.00025000000000000011.0842021724855042e-190.000250.000250.00565973760.00357188539999999960.01010931750.0022593176
1110267.00.02239.02239.090.02239.03.02.035.00.00.00621636606735157140.0087159433386411990.0308718122541904450.000193017229321412770.00025000000000000015.421010862427521e-200.000250.000250.00524482970.0039331815000000010.0137182000000000020.0018610907000000002
1211314.00.02430.02430.0191.02430.08.03.030.00.00.0050649844516227220.0094642198312124490.045666810125112530.000172650543390773240.00025000000000000015.421010862427521e-200.000250.000250.00425122450.00388185330000000030.0165949990.00169678950000000020.0270180310.0104413470.04749950.016540313
1312334.00.02508.02508.078.02508.00.02.045.00.00.0097931049439418870.0112492564802045210.0307167228311300240.00016814749687910080.00025000000000000015.421010862427521e-200.000250.000250.0063306530.00470961960.0140987210000000020.0016581904999999999
1413378.00.02684.02684.0176.02684.05.00.00.00.00.010116173028191870.0131932022680659320.046078924089670180.000146700724144466220.000250.00.000250.000250.00678022900000000050.0053444810.0222798640.00162534299999999980.0272868350.00738640550.033297190.016612418
1514425.00.02872.02872.0188.02872.00.00.00.00.00.007576202671538960.0101811290448463130.030559709295630450.000199503760086372520.00025000000000000015.421010862427521e-200.000250.000250.0058391230000000010.0044261010.018615910.0021889664000000002
1615449.00.02967.02967.095.02967.06.03.030.00.00.00700003486538965450.0112261647649852540.044561173766851430.000211073580430820560.000250.00.000250.000250.0056591590.00463243060.0218399070000000020.0021402768
1716469.00.03049.03049.082.03049.03.00.00.00.00.0129402723294333570.0097187073218165130.0306847691535949670.00030000176047906280.00025000000000000015.421010862427521e-200.000250.000250.0082735070.00358522430.0135935270.0028905305
1817533.00.03306.03306.0257.03306.01.05.055.00.00.0075396916704448810.0091217515162340460.030608570203185080.000221191789023578170.000250.00.000250.000250.006098030.00358674860.0133205650.002192039
1918585.00.03511.03511.0205.03511.07.00.00.00.00.0058381829902132530.0072845208986224850.015736436471343040.000177558831637725230.000250.00.000250.000250.00501612530.00351438140.0102001719999999990.0018448817
2019632.00.03701.03701.0190.03701.05.02.025.00.00.0060307754710207670.0086242054683932420.0308197047561407120.000146195292472839360.00025000000000000015.421010862427521e-200.000250.000250.00494608240.0040091634999999990.0140258010000000010.0014488354999999998
2120680.00.03891.03891.0190.03891.04.00.00.00.00.00621774687412350240.0091857243659319720.0309454686939716370.000161683739861473440.000250.00.000250.000250.00485255430.00414722969999999950.0145704210.0015842235999999998
2221729.00.04090.04090.0199.04090.09.04.050.00.00.00666868948937175250.0096332758591066370.030707109719514850.000162430762429721630.000250.00.000250.000250.0052176420.00449293850.019473690.001701550.0211589780.0117030890.0407021870.0043722745
2322804.00.04390.04390.0300.04390.09.05.060.00.00.0077451595755740750.0102649695952876150.04572815820574760.00017354256124235690.00025000000000000015.421010862427521e-200.000250.000250.0058580709999999990.0046085560.022067570.0016346257
2423862.00.04619.04619.0229.04619.06.03.030.00.00.00739571115072497950.0102384544199354280.0451989173889160160.000184226664714515180.000250.00.000250.000250.0057020480.00447796840.0199119360.0017970852
2524882.00.04699.04699.080.04699.06.00.00.00.00.0070196203269006220.0100974063295118980.0305565539747476580.000218154818867333260.00025000000000000015.421010862427521e-200.000250.000250.00580813130.00486528360.0194177630.0023302154
2625945.00.04951.04951.0252.04951.05.04.050.00.00.0056726152085632620.0076626910033045360.029895598068833350.000172232728800736340.000250.00.000250.000250.0049050780.00366956880.0136048150.00181302130.0310903580.0042599170.0376055170000000050.025178626000000003
2726995.00.05152.05152.0201.05152.08.02.015.00.00.0062991674215300010.0080468251750714130.0306434184312820430.000179758280864916740.000250.00.000250.000250.0051763350.0039150259999999990.0140204020.00172734520.0269281940.0096608420.0418870670.010047999
28271017.00.05242.05242.090.05242.01.02.035.00.00.0057312202440119690.0085596112870022920.030127475038170810.000195014639757573580.00025000000000000015.421010862427521e-200.000250.000250.00515021849999999950.0045667070.019456740.00212610190.021497970.0100333730.033704520.008069546
29281072.00.05462.05462.0220.05462.03.02.055.00.00.0088643557133360040.0118399998793926510.0461340397596359250.00016107557166833430.000250.00.000250.000250.00626777350.00500423969999999950.0201270730.00171920160.020619910.00758857560.0345496240.010614768
30291118.00.05646.05646.0184.05646.04.03.020.00.00.00723528898217611850.0087371812065685310.0304667782038450240.000181028139195404920.000250.00.000250.000250.0057925719999999990.0043183390.0199326090.0020496019
31301175.00.05874.05874.0228.05874.04.05.085.00.00.0083132044689721490.0110422853758254690.0457569323480129240.000196102220797911290.000250.00.000250.000250.0058836340.00469954640.0221683850.0018010390.00739511240.0109065390.025537572999999997-0.0070570237