1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-24 11:15:45 +01:00
Files
coach/rl_coach/traces/Atari_Dueling_DDQN_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

6.3 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.0486.0486.0486.0486.01.00.0
320.01.0573.0573.087.0573.01.00.0
430.01.0722.0722.0149.0722.01.00.0
540.01.01057.01057.0335.01057.01.00.0
6551.00.01260.01260.0203.01260.00.99979903000000445.055.00.00.0111591776855194060.0146708896320164370.055869828909635540.000147764745634049180.000100000000000000021.3552527156068802e-200.00010.00010.069805930.045508450.236722380.012701003
7670.00.01335.01335.075.01335.00.9997247800000062.015.00.00.0113637213699372580.011137438693586250.029802266508340840.000371895090211182830.00010.00.00010.00010.0613919240.029293530.1103556450.014809295
8791.00.01422.01422.087.01422.00.99963865000000781.015.00.00.0116244194269668130.0136555467948863310.043322343379259110.00028092472348362210.00011.3552527156068802e-200.00010.00010.0658225640.038666460.161924910.021505926
98159.00.01693.01693.0271.01693.00.99937036000001365.055.00.00.0087741634060468850.010568743451355050.0428306348621845250.00017176236724480990.000100000000000000032.7105054312137605e-200.00010.00010.053778270.0328959340.161965150.00941183
109201.00.01861.01861.0168.01861.00.99920404000001723.050.00.00.0057859837456600790.009169211065445530.041972763836383820.000211473510717041820.000100000000000000021.3552527156068802e-200.00010.00010.040834360.0282369780.149607150.012452139499999999
1110279.00.02172.02172.0311.02172.00.9988961500000244.065.00.00.0096438886524619870.0122172539080835240.0568943694233894350.000166674741194583480.000100000000000000034.0657581468206416e-200.00010.00010.05603940.0396820230.19137310.008496345
1211440.00.02815.02815.0643.02815.00.998259580000037810.0335.00.00.008614333940222310.0103484961021260230.044322296977043150.00015224301023408770.000100000000000000021.3552527156068802e-200.00010.00010.052208160.031562750.169410999999999980.0090553359999999990.0239758120000000030.0120251220.0409992930.0039341394
1312458.00.02888.02888.073.02888.00.99818731000003942.045.00.00.0095307871799579710.0107011338454785950.039997838437557220.000312160147586837360.00010.00.00010.00010.0623117280.0267719760.149214910.02184285
1413478.00.02969.02969.081.02969.00.99810712000004120.00.00.00.0067783089252770890.008182213834511540.028168741613626480.00024859263794496650.00010.00.00010.00010.0449087250.0265235660000000020.1173722740.016029207-0.0128887520.011372567-0.0004688017-0.031455092000000004
1514532.00.03183.03183.0214.03183.00.99789526000004564.050.00.00.00691698356698637950.0096692085164069070.0304610431194305450.00020857479830738160.000100000000000000032.7105054312137605e-200.00010.00010.0470451340.032068750.1239226760.010909475
1615551.00.03262.03262.079.03262.00.99781705000004742.015.00.00.0084654792919632430.012220642692865220.041360959410667420.000244271708652377130.00010.00.00010.00010.0514631159999999960.0366495070000000050.147543680.013367546000000001
1716626.00.03560.03560.0298.03560.00.99752203000005386.0145.00.00.009154050425717580.009907936480841880.0385793112218380.00027198388124816120.000100000000000000032.7105054312137605e-200.00010.00010.0551970.0295975670.149039250.010181868-0.0113651350.0131689230.013799908-0.025668386
1817916.00.04719.04719.01159.04719.00.996374620000078822.0340.00.00.007620585056529620.0095291402873091960.044691842049360280.000130196116515435280.000100000000000000021.3552527156068802e-200.00010.00010.0499778540.0310347429999999970.158616630.0080300250.0163848870.0186717720.056820348-0.0064531965
1918943.00.04830.04830.0111.04830.00.99626473000008123.045.00.00.0098093708683909070.0115174419183052060.045676212757825850.00025619892403483390.00011.3552527156068802e-200.00010.00010.0584722120.0279047490.139574990.021250565
20191006.00.05081.05081.0251.05081.00.99601624000008640.00.00.00.0114350956376461710.0110093624922483480.043694943189620970.00023745189537294210.000100000000000000032.7105054312137605e-200.00010.00010.064132520.032060170.176808770.015071165
21201062.00.05304.05304.0223.05304.00.99579547000009126.0105.00.00.0084251453439979480.0104767372552733650.042786892503499990.00023186049656942490.000100000000000000021.3552527156068802e-200.00010.00010.055281720.0333896880.166628690.0130808160.029903710.019630760.0596465000000000050.005463021
22211081.00.05379.05379.075.05379.00.99572122000009282.015.00.00.0077152156522252450.0106696771498937770.038376666605472570.000353485229425132330.00010.00.00010.00010.0559935870000000040.0307750220000000030.137611430.023046900000000002
23221125.00.05556.05556.0177.05556.00.99554599000009682.035.00.00.0070881241974182720.0075539183830538550.027353804558515550.000166353172971867020.000100000000000000032.7105054312137605e-200.00010.00010.0462507459999999950.0277807530.129171770.009924813000000001
24231169.00.05733.05733.0177.05733.00.99537076000010043.030.00.00.0077761858046869750.0091350699779121410.030137032270431520.00022505322704091670.000100000000000000032.7105054312137605e-200.00010.00010.0516416650.0302567129999999980.135352460.012026708
25241190.00.05815.05815.082.05815.00.99528958000010241.05.00.00.0111086498169849310.0105488356929367980.0423010066151618960.000358822639100253640.00011.3552527156068802e-200.00010.00010.066120810.032009740.159099330.027230294
26251212.00.05904.05904.089.05904.00.99520147000010421.010.00.00.0041954089095022270.0073765600222003050.0257090441882610320.000191166240256279680.00011.3552527156068802e-200.00010.00010.033349510.0289618579999999970.109137530.008333857