1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-29 03:25:47 +01:00
Files
coach/rl_coach/traces/Atari_A3C_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

11 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/Min
210.01.0172.01.0172.0172.00.00.0
320.01.079.01.079.0251.00.00.0
430.01.096.01.096.0347.00.00.0
540.01.0371.01.0371.0718.00.00.0
650.01.0344.01.0344.01062.00.00.0
7612.00.0254.01.0254.01316.00.06.080.00.00.195676490.22059690.68154220.000151080551.79162739999999994.0483294e-051.79169651.79152389999999980.22482600264581080.40424646262153071.0014296770095823-0.010522775352001190.0173540530.014620930.050496485-0.0036649210.106980970.123328574000000010.383138124.9710330000000006e-080.403057250.471602771.4861937-0.009990961999999999
8728.00.0310.01.0310.01626.00.01.030.00.00.0568212940.18504660.749115650.0022540541.79159760000000024.443067e-051.79168761.79140530.047285030235846840.2192762273764290.9940330386161804-0.026180915534496310.065921120.0057950960.079361804000000010.051137620.025158970.093949650.376686334.6014825e-060.084736150.370859861.4722064-0.02690432
9835.00.0130.01.0130.01756.00.00.00.00.00.0089764455000000010.00292426599999999960.0145600960000000020.00529815681.79153956.856453e-051.79165800000000021.791449-0.0089893257245421410.0060629081596849990.0016912072896957395-0.0295973718166351250.0902772840.00393222460.099056260.0835289965.8783422e-054.0506467e-050.000143733151.9011064e-05-0.0161161070.0053216093-0.00930417-0.026238699
10946.00.0216.01.0216.01972.00.03.020.00.00.104522510.238076050.813027560.00769344761.79151019999999986.7988265e-051.79162239999999981.79127939999999990.071670011356472980.30042989471392291.860807538032532-0.035832032561302180.129165160.00684579099999999950.146477420.114211470.0476973470.127825750.428865974.2765655999999996e-050.128370120.402045599999999951.3241034999999999-0.035978295
111054.00.0151.01.0151.02123.00.03.045.00.00.202792990.239255580.64074680000000010.0153524131.79143127.839572e-051.79154410000000031.7912040.147613558811800850.35822547999019920.9790486097335817-0.043574877083301540.150727540.00584481140.165609930.13910810.0750576260.0950390550.235378568.976024400000001e-050.264761840.373086870.9004108000000001-0.032910552
121159.00.086.01.086.02209.00.00.00.00.00.0220771470000000020.00239095440.0247082820.0182080421.79145484.2731140000000004e-051.79157421.7913142-0.0170752538368105880.010092539052040822-0.0002947151660919189-0.057339191436767580.171557580.006849120.191889170.162365480.000196711835.3809068e-050.000274072170.00012477461000000002-0.0306530370.003763158-0.02515122-0.03556459
131272.00.0243.01.0243.02452.00.04.050.00.00.300840650.406145131.22296860000000020.020540951.79107030000000010.00017510611.79139689999999961.7905860.15369699404885370.36593622802929680.9784829616546632-0.073250800371170040.27555940.041790780.364950750.227003380.078766040.1206518040.358532730.000159782490.275410030.495739599999999951.4355379-0.065343626
141378.00.0117.01.0117.02569.00.00.00.00.00.0858602150.00619602400000000050.092458260.075589731.79052620.000155527091.79081811.7902833999999999-0.040540992021560670.0224406094197929-0.002068936824798584-0.088990986347198490.39629280.0089731270.423562299999999950.383562060.00107357649999999990.000163445770.00126482600000000010.0008468464-0.072390840.0038452593-0.06634041-0.07710241
151483.00.092.01.092.02661.00.00.00.00.00.089871150.050773470.177109420.053452551.79069910.000145937831.79089189999999991.7901826-0.046403588727116580.0365554518574147-0.001846909523010254-0.140665590763092040.326588780.0376425120.419031770.29301160.00174479720.00186692540.00497492260.0005580194400000001-0.083089374000000010.04248289-0.052891272999999996-0.15625969
161596.00.0250.01.0250.02911.00.04.030.00.00.248040040.326250400000000050.84717510000000010.0279326851.79063080000000020.000161600421.79090751.78995880.092682621007164330.31626970349440090.9642215371131896-0.09290859103202820.357381940.0153824480.395243170.324628700000000050.054308300000000010.089284970.227842100000000020.00038700350.167504370.376381400000000030.8883538000000001-0.078577496
1716105.00.0173.01.0173.03084.00.03.060.00.00.367585240.44597551.41219010000000010.0616341871.79089360000000020.0001165766951.79115550000000011.79055520.135670168977230780.35726717344359030.9771864414215088-0.163548111915588380.359160200000000040.0362122170000000050.433663220.290715370.073023110.107149840.3153860.00062343750.24174440.465470251.3016641000000002-0.08348486
1817110.00.0100.01.0100.03184.00.01.030.00.00.51128699999999990.78422991.86885750.0217589341.79063130000000030.00018700351.79102621.79035080000000010.19104256816208360.39086522085942360.993660807609558-0.086726725101470950.363922450.026957190.412295070.323565480.094636460.162883670.376758559.845499e-050.342860040.676741361.5145043-0.07307689
1918121.00.0215.01.0215.03399.00.01.05.00.00.288109299999999960.391818231.4596980.088098371.7883620.000277213870000000051.78880260000000011.78745390.00510324254631996150.232628066220086860.9417458772659302-0.136146783828735350.622489040.0188399550.68769560.59776650.027070930.074094330.24933940.000836008050.0120204850.351141331.0632533000000002-0.14330262
2019126.00.092.01.092.03491.00.02.025.00.00.491239520.422012951.19371860000000020.136540651.78971910000000030.000252779781.79006761.78930280000000020.126649208739399920.36827183597798620.9198432564735411-0.122792124748229980.581032450.0273736470.62796760.54784660.075832080.08270450.20315320.00187844359999999980.22555870.379509840.8095318-0.12593104
2120132.00.0104.01.0104.03595.00.00.00.00.00.115575310.017573930.135438930.089594121.79011219.133325500000001e-051.7902321.7898425999999998-0.04825112551450730.0292689629436579250.009157657623291016-0.124708294868469250.49821480.0327075980.54538659999999990.45538130.00159242170000000010.000359401460000000030.0020207230.0011408231-0.0863129050.012666578999999999-0.06612517-0.099929444
2221157.00.0486.01.0486.04081.00.04.050.00.00.23264290.424133151.47176420.0113503471.79093270000000020.00043386141.79144131.78975420000000020.065497075642148670.31328199705736451.830270767211914-0.144970655441284180.342875360.079158510.483244499999999970.214503260.0512177350.13681520.534463054.279909000000001e-050.1174162850.50055941.6238992-0.19046536
2322162.00.097.01.097.04178.00.02.055.00.00.52300140000000010.459921331.0399750.0638838861.79075980000000020.000131742871.79098221.79038690.24611680954694750.42230103587403540.9653756022453308-0.062524408102035520.320469440.0081050955000000010.346068740.300653670.119455810.119482340.257335799999999950.00069400610.444500630.50937029999999991.0379127000000001-0.06295861
2423167.00.095.01.095.04273.00.00.00.00.00.077971430.0062535910.085007990.06881621.79055130.000110418419999999991.79084739999999991.7903417000000001-0.037008398398756980.01797619761499475-0.0031774044036865234-0.076129436492919920.379937260.0218769420000000030.411988140.34819090.00084638270.000151157450.0010465340.00063928636-0.066341720.00587621-0.058958582999999995-0.07316028
2524178.00.0217.01.0217.04490.00.03.040.00.00.301232499999999960.416374121.17560480000000010.034841121.79046110000000010.000427785940000000061.79113719999999991.78962640.073725494146347050.31368621168537231.7823173999786377-0.12466335296630860.476466980.0227718060000000020.536927460.42932590.0519172470.102538520.288725940.00033876310.135387940.41609741.017875-0.09954969
2625183.00.082.01.082.04572.00.01.05.00.00.287555130.312453720.828676340.1031777861.79106160000000018.027599e-051.79118371.79085710.0605868045240640640.299266838175988040.9531143307685852-0.089262604713439940.460909840.0056315119999999990.485083040.45090090.0466156970000000050.078267260.182178510.0013468370.109822440.338428900000000030.6959845-0.08936332
2726193.00.0186.01.0186.04758.00.00.00.00.00.0991813540.0601115970.251320.0423706470000000041.79137895.84025e-051.79150741.7912476000000002-0.0461016759276390160.034978435389355450.01036137342453003-0.144930988550186160.39265310.0587446580.479172650.314108040.00167442780.00199263730.00707082640.00042033980000000004-0.082371860.04802715-0.039025433-0.20672412
2827215.00.0440.01.0440.05198.00.06.0105.00.00.377640400000000040.64691342.54953359999999970.0205463381.79052510.000355964121.79126821.78989170.129251715186096380.40190520446235741.8412196636199951-0.190345436334610.405618880.0455671620.51832090.332863570.08911690.213267240.93067749999999990.000159358489999999980.23033550.639395362.3170965-0.110453404
2928227.00.0238.01.0238.05436.00.04.050.00.00.538553240.853709463.00820950.063364821.78940630000000020.00038726291.79021641.78850290.143356989459557940.44625302095187071.839568018913269-0.136426240205764740.512866560.039005730.59356309999999990.431892340.109846499999999990.24330330.8487660.00055283010000000010.256071420.66816342.149166-0.12197556
3029232.00.086.01.086.05522.00.02.055.00.00.68933059999999990.87640632.20558499999999970.1237638151.78848149999999980.000228449731.78906481.78813530.171384150907397280.41121204011540261.0042281150817869-0.163977444171905520.6000020.0179161130.652563870.57256390.0992339550.14621430.3512340.00241672970.3051960.643872141.4128844999999999-0.13909039
3130242.00.0184.01.0184.05706.00.02.030.00.00.35281140.306963861.19322870.149466771.78578980.000695867931.78703371.7839484-0.0262606352567672680.210695806717928340.8891516923904419-0.23818564414978030.89541489999999990.0556638431.01394640.80066230.0225411710000000020.0462243259999999960.152160360.0017163947-0.0472326650.226072730.582497-0.19729005