1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-29 09:52:31 +01:00
Files
coach/rl_coach/traces/Fetch_DDPG_HER_baselines_push/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

25 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/MinQ/MeanQ/StdevQ/MaxQ/MinTD targets/MeanTD targets/StdevTD targets/MaxTD targets/Minactions/Meanactions/Stdevactions/Maxactions/Min
210.01.0246.01.050.050.00.10.0
320.01.0492.02.050.0100.00.10.0
430.01.0738.03.050.0150.00.10.0
540.01.0984.04.050.0200.00.10.0
650.01.01230.05.050.0250.00.10.0
760.01.01476.06.050.0300.00.10.0
870.01.01722.07.050.0350.00.10.0
980.01.01968.08.050.0400.00.10.0
1090.01.02214.09.050.0450.00.10.0
11100.01.02460.010.050.0500.00.10.0
12110.01.02706.011.050.0550.00.10.0
13120.01.02952.012.050.0600.00.10.0
14130.01.03198.013.050.0650.00.10.0
15140.01.03444.014.050.0700.00.10.0
16150.01.03690.015.050.0750.00.10.0
17160.01.03936.016.050.0800.00.10.0
18170.01.04182.017.050.0850.00.10.0
19180.01.04428.018.050.0900.00.10.0
20190.01.04674.019.050.0950.00.10.0
21200.01.04920.020.050.01000.00.10.0
22210.01.05166.021.050.01050.00.11.0
232240.00.05412.022.050.01100.00.1-50.0-50.01.02.0331693449988960.85039153717078953.17893824353814130.26419118046760560.0010.00.0010.0010.357799830.413481532.31909280.08294742599999999-0.0838667450.036039210.014063141999999999-0.18772064-0.173612418940683720.329879457992082570.0-0.94411998614668840.25925594721652280.39019207933879641.227823900542619-0.9372007403745176
242380.00.05658.023.050.01150.00.1-50.0-50.01.0-0.696006950.20132177-0.40700984-1.0752876-0.20177484562799770.440145232481425730.9568684479780788-1.1499666634627257
2524120.00.05904.024.050.01200.00.1-50.0-50.01.0-0.26251920.077570975-0.15028442-0.46637243-0.080472363057172660.44059589477185720.9849893537921456-0.9840788640688172
2625160.00.06150.025.050.01250.00.1-50.0-50.00.0-0.318834360.10105262-0.18117407-0.55365396-0.23811243504157920.76449462623181731.3687597388090078-1.3640676098882232
2726200.00.06396.026.050.01300.00.1-50.0-50.01.0-0.449665580.057271924-0.34354517-0.56195430.092250220959529270.46741301287710210.9948808575127016-0.98098311874754
2827240.00.06642.027.050.01350.00.1-50.0-50.01.0-0.38871630.12020039-0.10681721599999999-0.58714919999999990.028826149252134680.72453126113293251.414451093509742-1.3259532360525004
2928280.00.06888.028.050.01400.00.1-50.0-50.01.0-0.089469840.036168184-0.04535828-0.19107908-0.063796641089119270.61242313841435611.1417325493151642-1.434773825976917
3029320.00.07134.029.050.01450.00.1-50.0-50.01.0-0.247654860.15735027-0.06830712-0.662744160.34110599910549330.74703997472706031.3737038443238658-1.3058005285253458
3130360.00.07380.030.050.01500.00.1-45.0-45.00.0-0.478536960.179215490.04704136-0.6201680.0162443989597939250.85057966292811351.6498611085007715-1.3084653584859094
3231400.00.07626.031.050.01550.00.1-50.0-50.01.0-0.92239789999999990.09895678599999999-0.7069314-1.1365553-0.33071998702252280.73011580535099971.1335083099505938-1.3476448528721057
3332440.00.07872.032.050.01600.00.1-50.0-50.01.0-1.02534120.027820772-0.9528443999999999-1.1011435-0.026306302831705280.72735594321519191.7432267339596406-1.3678844942920394
3433480.00.08118.033.050.01650.00.1-50.0-50.01.0-0.79371669999999990.056945752-0.72814316-0.958698750.04270602237272640.64950705522250941.3751033320077548-1.2825470697801584
3534520.00.08364.034.050.01700.00.1-50.0-50.01.0-1.33049380.07710761-1.1769803-1.4846857-0.33679123927902570.86044088593598181.4170751078702792-1.5739572669313346
3635560.00.08610.035.050.01750.00.1-50.0-50.00.0-0.224559220.12441484-0.06715275-0.53386915-0.154317417136837550.81081481960662681.3198284880850988-1.4052889829358401
3736600.00.08856.036.050.01800.00.1-50.0-50.01.0-1.63646880.11232845-1.4239153999999998-1.80077680.125387687737061160.77738260344652581.4270365165578809-1.3059165236608286
3837640.00.09102.037.050.01850.00.1-50.0-50.01.0-1.16009190.09548019-0.9979508-1.3798517-0.21014371225068410.72990849395688371.3239703718715294-1.3405360693743478
3938680.00.09348.038.050.01900.00.1-50.0-50.01.0-1.33053489999999970.116743065-1.0830333-1.5264891-0.131125539418453620.72112289841731591.3996307691678809-1.3468167621102938
4039720.00.09594.039.050.01950.00.1-50.0-50.01.0-1.29722689999999980.058427718-1.2196242-1.52835130000000020.256714108751666560.80363342629878211.4302267688224153-1.3225048658422485
4140760.00.09840.040.050.02000.00.1-50.0-50.00.0-1.46971170.038868353-1.4092494-1.5404080.22768686170900170.81116930090162681.368896955665806-1.5112654330797746
4241800.00.010086.041.050.02050.00.1-50.0-50.01.0-1.72282999999999990.022774475-1.6553037-1.7921136999999998-0.199727055957789640.72671291291390681.2971051014653034-1.5348926940943108
4342840.00.010332.042.050.02100.00.1-50.0-50.01.0-0.74139019999999990.14767262-0.3261515-1.12714080.22191548027280160.76866963997540691.497686934130685-1.4622850936414942
4443880.00.010578.043.050.02150.00.1-50.0-50.01.0-0.75004250000000010.20472693-0.40012062-0.99210334-0.305229872925600170.68262760546132211.2977333109631322-1.4360051523566801
4544920.00.010824.044.050.02200.00.1-50.0-50.01.0-1.37617860.04053226-1.2902478000000002-1.48446019999999980.40241270204238760.65354214132048981.4943356736073987-1.3483212559880042
4645960.00.011070.045.050.02250.00.1-50.0-50.00.0-1.87833989999999980.03977133-1.7768986-1.96972910000000010.042693147750766260.72474936869738781.2452966579051927-1.60697939463197
47461000.00.011316.046.050.02300.00.1-50.0-50.01.0-1.09498250.18261002-0.8952986999999999-1.67893669999999970.25518583727579680.89802877866944821.48400020012413-1.4256590199749255
48471040.00.011562.047.050.02350.00.1-50.0-50.01.0-1.09972619999999990.13790886-0.88532746-1.3162924999999999-0.106517690541559860.74679909168189711.3751687413337867-1.2916148280605215
49481080.00.011808.048.050.02400.00.1-50.0-50.01.0-1.95684210000000030.031073198-1.8812268-2.03528519999999970.023925235895865090.87911997687429081.5540368983189357-1.4150253806918207
50491120.00.012054.049.050.02450.00.1-50.0-50.01.0-1.42409680.0695922-1.3224534-1.6216253999999999-0.166870280490443220.77479901279512011.2533364593219234-1.3332732117039794
51501160.00.012300.050.050.02500.00.1-50.0-50.00.0-1.45118550.061609317000000004-1.3876598-1.61533740.45681015334466690.65640391404407321.4522356818687945-1.3766052101729644
52511200.00.012546.051.050.02550.00.1-50.0-50.01.0-2.25969889999999960.21262656-1.7902919999999998-2.44971730.32299708720660220.86985736757876711.5756605624822606-1.5719706032018492
53521240.00.012792.052.050.02600.00.1-50.0-50.01.0-3.47916340.63583505-1.8924183-3.991961-0.214878024852256540.76547509852448281.2148287999055831-1.4314497739134042
54531280.00.013038.053.050.02650.00.1-50.0-50.01.0-1.8877330.02124146-1.8334663-1.92629520000000020.219533381262009050.75001215552345511.5712411644872346-1.2143796661660864
55541320.00.013284.054.050.02700.00.1-50.0-50.01.0-1.83989189999999980.15510646-1.6592069999999999-2.1804787999999995-0.30647000783430710.72949357311406581.5306664199048594-1.3552070396446425
56551360.00.013530.055.050.02750.00.1-50.0-50.00.0-1.41412460.03262754-1.2785368000000001-1.49132660.083776883912110340.74082871053253631.2601677357389125-1.2046826144243492
57561400.00.013776.056.050.02800.00.1-50.0-50.01.0-1.65040299999999980.07072988-1.5547342-1.8263569-0.049900084204637720.70796226034110891.322484301451692-1.5006165730557879
58571440.00.014022.057.050.02850.00.1-50.0-50.01.0-0.707841630.39850888-0.15071458-1.48156070.0137071520267470740.83704269759255581.6409586994157928-1.432337582474792
59581480.00.014268.058.050.02900.00.1-50.0-50.01.0-2.22549530.03305099-2.160102-2.3088424-0.27577343788292250.73499662277225131.179067873081431-1.4301928495386762
60591520.00.014514.059.050.02950.00.1-50.0-50.01.0-1.6258790.12703513-1.3964875-1.8966624-0.0160552971689503570.75740661580554321.3780487650322228-1.4446393671908773
61601560.00.014760.060.050.03000.00.1-50.0-50.00.0-2.81251860.26148632-1.9427994-3.08559129999999950.58662555162280620.64296636527215291.6102285400431031-1.339575257666099
62611600.00.015006.061.050.03050.00.1-50.0-50.01.0-2.37789490000000030.12269698-2.0949543-2.582192-0.29831241312461550.81173399495873171.4650833701662742-1.4001502673363824
63621640.00.015252.062.050.03100.00.1-50.0-50.01.0-1.92465269999999980.22166970000000003-1.629804-2.34006790.0232671790675639430.82420129864316981.4072669005631666-1.4290240879624103
64631680.00.015498.063.050.03150.00.1-50.0-50.01.0-1.10051330.05581179-1.0090681000000001-1.29135760.387081323088713960.61126928617896561.5019505807089404-1.1291811534989495
65641720.00.015744.064.050.03200.00.1-50.0-50.01.0-3.73725270.50254893-2.224364-4.1089597-0.53861144123938120.68561630391919121.3840677228595573-1.4878649054376292
66651760.00.015990.065.050.03250.00.1-50.0-50.00.0-0.449433200000000030.12588142-0.18943691-0.60587750.26104109881280020.81082639141504241.4367657586692713-1.4549956358951526
67661800.00.016236.066.050.03300.00.1-50.0-50.01.0-1.93648300000000020.11262418-1.7692693-2.16098279999999980.0067782356098717680.74916627309898711.4197496527888105-1.3352868002453735
68671840.00.016482.067.050.03350.00.1-50.0-50.01.0-1.37494430000000010.32924566-1.0613506000000001-1.9931729-0.162556972190493780.76096161444080351.283837952846872-1.5003897122572196
69681880.00.016728.068.050.03400.00.1-50.0-50.01.0-2.68077560.051952362-2.5637708-2.7463780.50870350094688930.65362265794093061.4450647015885665-1.4680872109611465
70691920.00.016974.069.050.03450.00.1-50.0-50.01.0-1.9641620.20525181-1.5283898999999999-2.1946072999999995-0.18422852495603120.83365281002528621.4655713363705043-1.3965235642365534
71701960.00.017220.070.050.03500.00.1-50.0-50.00.0-1.90102219999999990.23892573-1.6515872-2.585541-0.48079794133704450.71038961145462491.1969149402246186-1.4034559769984063
72712000.00.017466.071.050.03550.00.1-50.0-50.01.0-1.73137180.27999409999999997-1.3405614-2.28412079999999970.210349333720199840.85104877268348791.3737059403773917-1.6120096897674407
73722040.00.017712.072.050.03600.00.1-50.0-50.01.0-2.72055050000000030.13725172-2.5462599-3.00639250.27390595550783870.75366878625220231.4429531858860387-1.3420880015785193
74732080.00.017958.073.050.03650.00.1-50.0-50.01.0-3.36744260.09728077-3.2732415-3.6456112999999997-0.35438252075194730.71701713664658631.1274625363413708-1.5088593335279223
75742120.00.018204.074.050.03700.00.1-50.0-50.01.0-1.48659010000000010.2500631-1.0712678-1.90090040.311454643164316260.77991979693472771.455261865487077-1.3463735456440913
76752160.00.018450.075.050.03750.00.1-50.0-50.00.0-1.41733340.07337376-1.2718675-1.58327150.190128592267052530.79768003414245961.4385122345376309-1.4377212587612993
77762200.00.018696.076.050.03800.00.1-50.0-50.01.0-1.83712020000000020.26847792-1.5797471-2.42131880.3509060479539170.8738485152476111.3936589453765458-1.322788803615944
78772240.00.018942.077.050.03850.00.1-50.0-50.01.0-2.7354180.10068395-2.5927787-2.945773-0.415153013185131040.70490037007840121.4338445369816888-1.4077773226900596
79782280.00.019188.078.050.03900.00.1-50.0-50.01.0-0.55463170.21695954-0.17885058-0.9201853000000001-0.320070608797326630.84489285791139021.353307753915339-1.4494403592041702
80792320.00.019434.079.050.03950.00.10.00.01.0-0.147099210.13921742-0.023953345-0.5407897-0.0147695511407483020.77871879962971741.3218870738761437-1.4582294864115697
81802360.00.019680.080.050.04000.00.10.00.00.0-0.10236660.04695243-0.027832968-0.209381430.066083239070648250.88609166995050631.5120961475676309-1.4138220257875758
82812400.00.019926.081.050.04050.00.1-50.0-50.01.0-2.10143570000000060.20277213-1.6412936000000002-2.3556743-0.037979351393697770.74624969856793191.3328335264201845-1.4144462370541617
83822440.00.020172.082.050.04100.00.10.00.01.0-1.01061890.13590429999999998-0.6872054-1.3695973-0.219140194528893880.8097595322508921.3152777893070762-1.4194264280294104
84832480.00.020418.083.050.04150.00.1-50.0-50.01.0-2.31110999999999980.3365883-1.9075852999999998-2.9735315-0.45305672868280540.57826780537591271.022169542183727-1.3848625461049413
85842520.00.020664.084.050.04200.00.1-50.0-50.01.0-3.06843110.13012081-2.7071897999999996-3.23341079999999970.150702787345546650.83379259960169461.406189822130373-1.494632426377883
86852560.00.020910.085.050.04250.00.1-50.0-50.00.0-4.42231130.76218843-3.247564-6.755247-0.072927562972999240.93487575216210211.4692699356571117-1.48582620951939
87862600.00.021156.086.050.04300.00.1-50.0-50.01.0-2.16716460.22452615-1.7827939999999998-2.69684550000000020.29671813260309030.75668495632171211.3762094739688924-1.4500447872494764
88872640.00.021402.087.050.04350.00.10.00.01.0-0.802648540.08630416-0.64508265-1.0452366-0.014937141321807160.71834681913631251.4561862287948335-1.344690838645084
89882680.00.021648.088.050.04400.00.1-50.0-50.01.0-2.95035670000000040.42755370000000004-2.4801517000000004-4.0419350.20230713719870840.78466382557564541.243425110484488-1.3571410914237387
90892720.00.021894.089.050.04450.00.1-50.0-50.01.0-3.58053570000000040.27993327-3.1748416-4.21802430.34523275223795480.85205207362145421.4941343959471864-1.3648308606190092
91902760.00.022140.090.050.04500.00.10.00.00.0-1.01210930.17985001-0.7327166-1.41004940.140606478054570660.84583324507714061.449001352593843-1.389704211843393
92912800.00.022386.091.050.04550.00.1-50.0-50.01.0-4.17119740000000050.07215533-3.9150252-4.2978926-0.39873909816078280.72878150487910351.2803267238292884-1.4170843170713543
93922840.00.022632.092.050.04600.00.1-50.0-50.01.0-2.52026320.11855404-2.2732716-2.6704520.063382432759018280.89585519877218711.545837059254212-1.5648305810853482
94932880.00.022878.093.050.04650.00.1-50.0-50.01.0-2.7114510.17342134-2.4022179-3.0157795-0.20796178088249920.73351268359583121.3720575666959482-1.3778438211359514
95942920.00.023124.094.050.04700.00.1-50.0-50.01.0-3.29964110.038059222999999996-3.2428157000000004-3.4113495000000005-0.0133170633706896880.91804872315838291.486300487899365-1.4310042088978487
96952960.00.023370.095.050.04750.00.1-50.0-50.00.0-1.96503410.19960472-1.7070503000000001-2.34362630.345092158397782340.70606694433604861.4322506235176529-1.306946924303393
97963000.00.023616.096.050.04800.00.1-50.0-50.01.0-3.07832570.22825454-2.701143-3.52385950.105300863023831240.8718777244766741.4465424068411503-1.489341601833312
98973040.00.023862.097.050.04850.00.1-50.0-50.01.0-3.34039430.106763296-3.176488-3.67366120.131374097769106160.60546136256634751.2549000925446976-1.2520509921026246
99983080.00.024108.098.050.04900.00.1-50.0-50.01.0-2.90011139999999970.18058929999999998-2.6664875-3.22100690.31138305723943150.78799148738794211.5280138366902059-1.236833491780251
100993120.00.024354.099.050.04950.00.1-50.0-50.01.0-2.77277660.19101027-2.501236-3.16672590.22315770636292250.78498081403746231.386135651160025-1.348654604041181
1011003160.00.024600.0100.050.05000.00.1-50.0-50.00.0-4.2719580000000010.12707190000000002-3.9653187000000005-4.445610.089553063176791290.85365409993944011.4308109345746776-1.4848914135150824
1021013200.00.024846.0101.050.05050.00.1-50.0-50.01.0-3.9281970.18075417-3.5554383-4.148139-0.06370473982332410.79760806379362631.2611650852014078-1.455975304756958
1031023240.00.025092.0102.050.05100.00.1-50.0-50.01.0-4.00133470.037731073999999996-3.922646-4.118436-0.147443813502076940.63115291091387791.2293468502725515-1.422216521388387
1041033280.00.025338.0103.050.05150.00.1-50.0-50.01.0-1.17626390.10546544-0.97625005-1.4112893-0.0022774596991363710.90968398475686041.5091018704803143-1.7058408247595245
1051043320.00.025584.0104.050.05200.00.1-50.0-50.01.0-2.40673850.19492403-2.162632-2.74801040.07133195960647010.85852371682921311.47737389816903-1.3800267645005382
1061053360.00.025830.0105.050.05250.00.1-50.0-50.00.0-3.61688470.09879131-3.3805120000000004-3.7366817-0.032934650685201990.91170854884233881.5419910576002585-1.4786932589886754
1071063400.00.026076.0106.050.05300.00.1-49.0-49.01.0-0.66089680000000010.42095757-0.43678153-2.62685439999999960.0616561740299177640.87458170860075991.4217855815135148-1.3208729061698978
1081073440.00.026322.0107.050.05350.00.1-50.0-50.01.0-4.24990270.067996174-4.1468744-4.3801136-0.10462695469224340.88236817198196651.5729962033286573-1.4420075455330732
1091083480.00.026568.0108.050.05400.00.1-50.0-50.01.0-3.257070.03467515-3.179968-3.369099-0.189177229087592460.73636744477204731.2525810654838214-1.5368249808373515
1101093520.00.026814.0109.050.05450.00.1-50.0-50.01.0-3.46557550.14110744-3.3056424-3.85005429999999960.4255747574881550.79520090228350421.453834068547324-1.3040695989216449
1111103560.00.027060.0110.050.05500.00.1-50.0-50.00.0-4.4713180.078934245-4.308751999999999-4.65474460.2771839792047730.65167580905973321.4279513171448566-1.031981491060446
1121113600.00.027306.0111.050.05550.00.1-50.0-50.01.0-0.99996790.09350092-0.8654363000000002-1.3219192-0.0539928599802639640.90386534952644881.3643905707399808-1.6959016492633998
1131123640.00.027552.0112.050.05600.00.1-50.0-50.01.0-2.58998780.15193488-2.3156939999999997-2.9048746-0.043250610651228050.87072522833566131.4166294546081797-1.4980873623321107
1141133680.00.027798.0113.050.05650.00.1-50.0-50.01.0-4.4972780.06248033-4.366797-4.6102910.0178044178416782680.9285075864672011.6427103082120351-1.4552612594092345
1151143720.00.028044.0114.050.05700.00.1-50.0-50.01.0-4.36211630.13424858-4.1261806000000005-4.6272870.061388543595731820.82623596578319131.3930191022242877-1.4014646269685735
1161153760.00.028290.0115.050.05750.00.1-50.0-50.00.0-2.0187930.11110189-1.6619618000000003-2.2022383-0.088518980733621370.88259169026769491.4775910578863316-1.5406449648621856
1171163800.00.028536.0116.050.05800.00.1-50.0-50.01.0-3.32938360000000030.15261324-2.8993397-3.5512528-0.39704078471662970.57650319796060570.9428601020895836-1.4464266096290483
1181173840.00.028782.0117.050.05850.00.1-50.0-50.01.0-3.4837940.17306635-3.1404436000000002-3.8201406-0.0143362695929532880.77154842528420081.3425052190199105-1.4244233775406328
1191183880.00.029028.0118.050.05900.00.1-50.0-50.01.0-2.1937370.5184871-1.3577597-2.89143590.138708202680673580.76906187213243691.3768072144565098-1.538222642194293
1201193920.00.029274.0119.050.05950.00.1-50.0-50.01.0-4.23899650.1430504-3.9521208-4.507492500000001-0.35114362008345480.80736679314295591.445177182789321-1.4221231351138957
1211203960.00.029520.0120.050.06000.00.10.00.00.0-0.420775560.21623069-0.18328568-1.0574919-0.0137885538929145480.89718264637984441.4867908187589371-1.3642749753936756