1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-29 19:55:56 +01:00
Files
coach/rl_coach/traces/Atari_DDQN_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

6.3 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinQ/MeanQ/StdevQ/MaxQ/Min
210.01.0486.0486.0486.0486.01.00.0
320.01.0573.0573.087.0573.01.00.0
430.01.0722.0722.0149.0722.01.00.0
540.01.01057.01057.0335.01057.01.00.0
6551.00.01260.01260.0203.01260.00.99979903000000445.055.00.00.0110078261407347870.0146034810571534780.057287234812974920.000176793808350339530.000250.00.000250.000250.050014480.036737250.188648750.012782556
7670.00.01335.01335.075.01335.00.9997247800000062.015.00.00.0114993907474471540.0109610000636458720.03154670447111130.00054419366642832770.00025000000000000015.421010862427521e-200.000250.000250.057593234000000010.0249926670000000030.110289340.022131458
8791.00.01422.01422.087.01422.00.99963865000000781.015.00.00.011328843546964760.0134293862032649640.043645411729812620.0003407037293072790.00025000000000000015.421010862427521e-200.000250.000250.0554663580.034080650.139732570.016833907
98159.00.01693.01693.0271.01693.00.99937036000001365.055.00.00.0087059729966272450.010465941527383980.042592473328113560.000264054571744054560.000250.00.000250.000250.045169680.0278069340.126733440.0123512
109201.00.01861.01861.0168.01861.00.99920404000001723.050.00.00.0058782991864496750.0094378526320402310.044341240078210830.000137268216349184490.000250.00.000250.000250.0313201029999999950.026000940.1184478850.008186511
1110279.00.02172.02172.0311.02172.00.9988961500000244.065.00.00.0097271425306994040.0122689365336537870.054825037717819220.000132241199025884270.00025000000000000015.421010862427521e-200.000250.000250.0458183770.033228480.134699870.006579738000000001
1211440.00.02815.02815.0643.02815.00.998259580000037810.0335.00.00.0086512617236842050.010352187435444920.04605223238468170.000125826350995339430.00025000000000000015.421010862427521e-200.000250.000250.0425654650.0276835039999999980.134928200000000030.00556415270.0104718780.0167721450.03140713-0.011288109
1312458.00.02888.02888.073.02888.00.99818731000003942.045.00.00.0095075779002703130.0106861655513159430.039721839129924770.00042335918988101180.000250.00.000250.000250.047219370.0255199450.111070150.016729604
1413478.00.02969.02969.081.02969.00.99810712000004120.00.00.00.0069165431326837280.0082563736710425050.027939710766077040.000342709419783204850.00025000000000000015.421010862427521e-200.000250.000250.037852280.0230636930.0909552650.014428139-0.00069354410000000010.0084597950.012772851000000002-0.010573764
1514532.00.03183.03183.0214.03183.00.99789526000004564.050.00.00.00698413917335629750.0097279305252402850.0309584215283393830.000199823276489041720.000250.00.000250.000250.035173780.0281210710.097452535999999990.007200333000000001
1615551.00.03262.03262.079.03262.00.99781705000004742.015.00.00.0085714607105872260.0124799030649225050.0430563688278198240.00017904212290886790.00025000000000000015.421010862427521e-200.000250.000250.038619440.033493470.123679850.008007733000000001
1716626.00.03560.03560.0298.03560.00.99752203000005386.0145.00.00.009160615674281380.009852626077304020.038631342351436620.000224598668864928190.00025000000000000015.421010862427521e-200.000250.000250.045016830.0263036270000000030.109413810.010795511-0.0075079850.0153763739999999980.009678967-0.030438615
1817885.00.04598.04598.01038.04598.00.99649441000007612.0210.00.00.0076904138308566810.0097222732145871090.044841989874839780.000145406840601935980.00025000000000000015.421010862427521e-200.000250.000250.0401539950.0281076080.133341010.0071760030.017526270.0186406060.05206203-0.006777686
1918903.00.04668.04668.070.04668.00.99642511000007762.015.00.00.0083740751519251390.006850584875498060.016957255080342290.000136437331093475220.000250.00.000250.000250.0417697840.0253888010000000030.06932660.006554181
2019924.00.04754.04754.086.04754.00.99633997000007962.035.00.00.0069862552066167280.0091380395121877230.0294351223856210670.000239591972786001860.00025000000000000015.421010862427521e-200.000250.000250.037657290.0277414229999999980.094834570.009779892
2120988.00.05007.05007.0253.05007.00.99608950000008481.05.00.00.0089581099395189320.0103521100180178490.041209306567907330.000260690896539017560.000250.00.000250.000250.0442801230.0273521820000000030.105593460.010518369
22211009.00.05092.05092.085.05092.00.99600535000008680.00.00.00.0097290273766875960.0074179153931231580.0264108180999755860.00051712902495637540.00025000000000000015.421010862427521e-200.000250.000250.053050240.0229274480.111491090.019090652
23221041.00.05221.05221.0129.05221.00.99587764000008961.010.00.00.00651091213785548550.0097051445640429240.042300540953874590.00024643848882988090.000250.00.000250.000250.038575640.0296520129999999980.131660250.0083391290.0226678550.0298225250.073469676-0.01822071
24231101.00.05461.05461.0240.05461.00.99564004000009486.085.00.00.0074647580368036880.0088715549426364140.041591089218854899.95906739262864e-050.000250.00.000250.000250.0406319829999999970.0277804250.12730760.0052087842
25241167.00.05724.05724.0263.05724.00.99537967000010041.025.00.00.0070499510219440590.0081455978563091480.0295563805848360060.000247470685280859470.000250.00.000250.000250.037561830.0250775980.105309270.010631736000000001
26251188.00.05808.05808.084.05808.00.9952965100001020.00.00.00.0088498358485854370.0088129605203267490.0281904675066471130.000186631281394511460.00025000000000000015.421010862427521e-200.000250.000250.044691305999999990.0286273160.105850280.008268119