1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-16 22:25:47 +01:00
Files
coach/rl_coach/traces/Atari_A3C_LSTM_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

8.7 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinAdvantages/MeanAdvantages/StdevAdvantages/MaxAdvantages/MinValues/MeanValues/StdevValues/MaxValues/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/MinPolicy Loss/MeanPolicy Loss/StdevPolicy Loss/MaxPolicy Loss/Min
210.01.0248.01.0248.0248.00.00.0
320.01.0123.01.0123.0371.00.00.0
430.01.088.01.088.0459.00.00.0
540.01.0187.01.0187.0646.00.00.0
650.01.086.01.086.0732.00.00.0
760.01.0331.01.0331.01063.00.00.0
8737.00.0753.01.0753.01816.00.018.0275.00.00.192763760.249041530.82577470.000134551421.79147270.000300573041.79175660000000031.79043350.21193696959341430.40298962566012491.8961015939712524-0.0381092429161071850.0594071559999999960.060258710.20562454-0.00599524939999999950.108793970.135437340.428847041.0928144400000001e-070.379468620.48194471.3135536-0.039071497000000004
9843.00.0107.01.0107.01923.00.00.00.00.00.0152395610.00249961480.0195162540.0118432389999999981.79064170000000010.00027793531.79170819999999981.790199-0.0276092262566089640.013555497071717059-0.002493098378181457-0.0548129230737686160.235264580.0131031520.258802860.21494020.00106585960000000024.0590476e-050.00110933220.0010099161-0.049343170.011472803-0.039747406-0.06998761
10947.00.073.01.073.01996.00.01.025.00.00.369247440.483453661.05295419999999980.0273824279999999971.79047780000000010.000294105561.79170299999999981.79023839999999980.21826987961928050.40748328543034970.9812830686569214-0.056619286537170410.290256230.014016440.30927640.271065560.123171720.17146580.365660970.00191093619999999990.392132850.62708561.2789414-0.0569774
111060.00.0251.01.0251.02247.00.04.030.00.00.438636400000000040.859590232.52172599999999970.0353127831.78564930000000020.00158947869999999991.79160449999999981.78119419999999980.110470045854647970.45112920589958691.8077605962753296-0.144873857498168950.610562260.077534380.799116250.49616610.14714020.313185840.88261190.00505343730.19477450.67804831.7730703-0.1461431
121166.00.0121.01.0121.02368.00.00.00.00.00.148717850.025477660.191082130.122117051.77806680000000020.00278632131.79130430000000021.7759546000000002-0.106071116328239440.05572927975960492-0.009624600410461426-0.197530508041381861.02206730.0340969941.06726320.954221960.0212553260.00123134970.0234167479999999980.019953651-0.189680490.016377756-0.16716708-0.21171494
131271.00.099.01.099.02467.00.00.00.00.00.155550120.046103890.235112280.122600771.77950430000000010.00233331971.79122939999999971.7773554-0.099512973427772530.055938959522401716-0.003077983856201172-0.19113719463348391.00867690.0366911660000000041.08689680.964161160.021083660.00230257070.023518390.018744798-0.176435960.02231863-0.14677188-0.20941082
141398.00.0534.01.0534.03001.00.013.0340.00.00.74376581.13635289999999993.41280560000000040.0259029141.77731860.0039277231.79113691.76426510000000030.12667316358823040.411580959024579361.7730363607406616-0.302608013153076170.95779290000000010.237657931.65048469999999980.71229540.152210740.206070930.7578790.0086163850.226231660.61718211.5555726-0.30132666
1514102.00.073.01.073.03074.00.00.00.00.00.211331590.0276937170000000030.245782430.177973361.77199790.0041243441.79041031.7675488999999998-0.187840455770492520.09607193868630864-0.01820123195648193-0.33488440513610841.81579140000000020.0087189971.83837930000000021.79245690.0564225430.00169113700000000010.057871560.054050256-0.333373160.013935573-0.3163426-0.35047740000000005
1615110.00.0156.01.0156.03230.00.02.035.00.00.624057650.95926499999999992.95721670.129695491.78288230.00248804299999999971.79082269999999991.7776691000000002-0.064901731695447650.30360052584326661.5683243274688718-0.32598519325256351.72114650.0297131331.78079399999999981.6786120.122914740.177454440.55741870.04589054-0.1122900250.436017070.9523574-0.32423943
1716122.00.0238.01.0238.03468.00.01.010.00.00.155547560.0542771970.297416870.088465521.78685870.00079671195000000011.79103420000000011.7857143-0.15733955556696110.126150477726606260.7546746730804443-0.36322581768035891.67295870.175953151.99447720000000021.5029530.052325380.0352341980.159778610.03124919-0.280812560.079151474-0.07227526599999999-0.35482806
1817132.00.0185.01.0185.03653.00.00.00.00.00.085961610.0241572930.130887420.043590921.78658100000000020.00087282789999999991.79141739999999981.7849907999999999-0.11454265018304190.06364297653654805-0.008004307746887207-0.261716723442077641.10012040.173923941.44502870.877435270.0160551830.0052728770.0273439870.009870462-0.204794030.033908524-0.15814927-0.26577490000000004
1918156.00.0469.01.0469.04122.00.07.0300.00.00.440794099999999970.68363509999999992.48706170000000040.0258626681.78060600000000010.00464180531.79153171.77328089999999980.0624537264523298760.34298798141558530.985063374042511-0.201651334762573240.920024450.101202755999999991.13284470.76514589999999990.0914980.131052199999999980.425098660.0075230170.111297190.471856999999999971.4228208-0.21573834
2019170.00.0276.01.0276.04398.00.03.020.00.00.401786630.67280980000000012.40947299999999980.0392252541.76454470.00426484549999999951.7906341.7581539000000002-0.008002275916246270.31898288899205421.685505986213684-0.22163343429565431.13044789999999980.0389056881.20243121.06206889999999990.07869090.16142530.580357130.012360237-0.0113295569999999990.45005821.2747773999999998-0.24353555
2120191.00.0420.01.0420.04818.00.07.0315.00.00.364995180.37053131.15756210.044315291.76367380.0045793041.79010491.75509419999999980.0218208190798759440.30712320930631150.947787582874298-0.21001416444778440.91542809999999990.090962021.12734260000000020.759127260.071791290.093262420.249041899999999980.00624195630.0403304850.340317070.7956849-0.22417025
2221197.00.0104.01.0104.04922.00.01.030.00.00.51443500000000010.7186481.95119810.124532044000000011.7614230.00560957751.79020069999999981.75718560.050702840089797970.31686006978969340.925861954689026-0.162954568862915040.887455340.0126909180.937815370.87516330.064959495999999990.113002020.29096340.0082930570.0784822850.450153740.97864294-0.15680452
2322203.00.0113.01.0113.05035.00.02.015.00.00.641188260.867939052.37679170.181727131.75370299999999980.0061729061.78967711.7492370.087477405071258550.44371137292342381.7083609104156494-0.184635400772094750.95693050.0248230211.00544950.923412740.126941350.233062399999999980.59306510.0095914910.155469850.650496071.4562124-0.19140014
2423212.00.0162.01.0162.05197.00.01.05.00.00.163685980.0410532840.225970340.102907821.76067099999999990.0040701511.78984550000000021.7559046999999999-0.064075947180390360.133530174921220130.855490505695343-0.177024900913238530.833082560.0352288630.91946450.78079960.0203365420.0323997330.106031050.007155789-0.113512520.087162120.11233470599999999-0.18262672
2524216.00.069.01.069.05266.00.00.00.00.00.180229960.0145851650.199337020.163947331.76443289999999990.0057736471.79024339999999981.7603636000000003-0.073773236076037090.04112314147691277-0.0068422555923461905-0.14222496747970580.71451540.0261940450.75141430.68387470.0064491793000000010.000460157549999999960.0067751170.0057984180000000005-0.129041270.008062066-0.118956625-0.13869014
2625238.00.0432.01.0432.05698.00.06.085.00.00.330064360.356179121.30673039999999970.068903321.77171019999999980.0041068221.79021951.76274160.079115852216879530.32495307712685330.9620689153671264-0.132538139820098880.52617190.069309140.70880740000000010.444181999999999970.061081070.094907780.292770530.00230365760.1407080.410828021.2361767-0.12985662