mirror of
https://github.com/gryf/coach.git
synced 2026-02-20 08:45:55 +01:00
* reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file
6.2 KiB
6.2 KiB
| 1 | Episode # | Training Iter | In Heatup | ER #Transitions | ER #Episodes | Episode Length | Total steps | Epsilon | Shaped Training Reward | Training Reward | Update Target Network | Evaluation Reward | Shaped Evaluation Reward | Success Rate | Loss/Mean | Loss/Stdev | Loss/Max | Loss/Min | Learning Rate/Mean | Learning Rate/Stdev | Learning Rate/Max | Learning Rate/Min | Grads (unclipped)/Mean | Grads (unclipped)/Stdev | Grads (unclipped)/Max | Grads (unclipped)/Min | Q/Mean | Q/Stdev | Q/Max | Q/Min |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 1 | 0.0 | 1.0 | 486.0 | 486.0 | 486.0 | 486.0 | 1.0 | 0.0 | |||||||||||||||||||||
| 3 | 2 | 0.0 | 1.0 | 573.0 | 573.0 | 87.0 | 573.0 | 1.0 | 0.0 | |||||||||||||||||||||
| 4 | 3 | 0.0 | 1.0 | 722.0 | 722.0 | 149.0 | 722.0 | 1.0 | 0.0 | |||||||||||||||||||||
| 5 | 4 | 0.0 | 1.0 | 1057.0 | 1057.0 | 335.0 | 1057.0 | 1.0 | 0.0 | |||||||||||||||||||||
| 6 | 5 | 51.0 | 0.0 | 1260.0 | 1260.0 | 203.0 | 1260.0 | 0.9997990300000044 | 5.0 | 55.0 | 0.0 | 36.40264579741394 | 49.44809691530482 | 198.9517517089844 | 0.1989624798297882 | 5.000000000000001e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 7.8067063999999995 | 7.391117599999999 | 36.182125 | 1.4651048 | |||||||
| 7 | 6 | 70.0 | 0.0 | 1335.0 | 1335.0 | 75.0 | 1335.0 | 0.999724780000006 | 2.0 | 15.0 | 0.0 | 39.23580433663569 | 41.016771031064785 | 99.57048797607422 | 0.261367529630661 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 10.85398 | 7.770962700000001 | 26.800815999999998 | 2.6992220000000002 | |||||||
| 8 | 7 | 91.0 | 0.0 | 1422.0 | 1422.0 | 87.0 | 1422.0 | 0.9996386500000078 | 1.0 | 15.0 | 0.0 | 38.0207036946501 | 47.816324507097775 | 148.4340362548828 | 0.3246855139732361 | 5e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 11.630027 | 8.391272 | 40.402522999999995 | 4.429657499999999 | |||||||
| 9 | 8 | 159.0 | 0.0 | 1693.0 | 1693.0 | 271.0 | 1693.0 | 0.9993703600000136 | 5.0 | 55.0 | 0.0 | 28.948033251869145 | 36.08735969022815 | 148.06871032714844 | 0.3664446473121643 | 5.0000000000000016e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 11.167119999999999 | 6.748284 | 31.783441999999997 | 4.7481833 | |||||||
| 10 | 9 | 201.0 | 0.0 | 1861.0 | 1861.0 | 168.0 | 1861.0 | 0.9992040400000172 | 3.0 | 50.0 | 0.0 | 18.986573637950986 | 31.753488181878904 | 146.00331115722656 | 0.3643231987953186 | 5.000000000000001e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 9.701611999999999 | 7.381705 | 43.52698 | 4.8068805 | |||||||
| 11 | 10 | 279.0 | 0.0 | 2172.0 | 2172.0 | 311.0 | 2172.0 | 0.998896150000024 | 4.0 | 65.0 | 0.0 | 32.03847728096522 | 41.925721917185676 | 194.55128479003903 | 0.2954877316951752 | 5.0000000000000016e-05 | 2.0328790734103208e-20 | 5e-05 | 5e-05 | 18.140017999999998 | 13.331147 | 86.43568 | 3.4344087 | |||||||
| 12 | 11 | 440.0 | 0.0 | 2815.0 | 2815.0 | 643.0 | 2815.0 | 0.9982595800000378 | 10.0 | 335.0 | 0.0 | 28.069273558832844 | 34.57541787899187 | 143.40928649902344 | 0.4890246391296386 | 5.000000000000001e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 21.02929 | 14.073232 | 105.471725 | 7.160336 | 0.016182939943779884 | 0.00317882025612132 | 0.020630363033269532 | 0.012337473193183542 | |||
| 13 | 12 | 458.0 | 0.0 | 2888.0 | 2888.0 | 73.0 | 2888.0 | 0.9981873100000394 | 2.0 | 45.0 | 0.0 | 30.90841283400853 | 36.51044361360722 | 138.05393981933594 | 1.1750518083572388 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 34.070717 | 18.229351 | 87.29681 | 17.943182 | |||||||
| 14 | 13 | 478.0 | 0.0 | 2969.0 | 2969.0 | 81.0 | 2969.0 | 0.9981071200000412 | 0.0 | 0.0 | 0.0 | 21.819078975915904 | 26.667606415139126 | 94.14669799804688 | 1.2481423616409302 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 29.088186 | 7.6060133 | 48.72128 | 18.927726999999997 | 0.017232515091842898 | 0.004842224507737858 | 0.023045246455294547 | 0.011178828286356295 | |||
| 15 | 14 | 532.0 | 0.0 | 3183.0 | 3183.0 | 214.0 | 3183.0 | 0.9978952600000456 | 4.0 | 50.0 | 0.0 | 22.003884507550136 | 31.01379918463093 | 93.17678833007812 | 0.7777568697929382 | 5.0000000000000016e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 25.917735999999998 | 17.056901999999997 | 82.94768499999999 | 11.922298 | |||||||
| 16 | 15 | 551.0 | 0.0 | 3262.0 | 3262.0 | 79.0 | 3262.0 | 0.9978170500000474 | 2.0 | 15.0 | 0.0 | 26.501989019544503 | 38.53608557260633 | 132.7838134765625 | 0.7758174538612366 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 31.654518 | 29.090153000000004 | 115.05221999999999 | 12.105766000000001 | |||||||
| 17 | 16 | 632.0 | 0.0 | 3584.0 | 3584.0 | 322.0 | 3584.0 | 0.9974982700000544 | 6.0 | 145.0 | 0.0 | 26.855747359991074 | 31.176486537250174 | 123.03897094726562 | 0.973316729068756 | 5.000000000000001e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 36.815296000000004 | 22.30738 | 154.81078 | 14.858197 | 0.015187424104939663 | 0.005942733756463785 | 0.02293543783511268 | 0.008310022529549316 | |||
| 18 | 17 | 671.0 | 0.0 | 3742.0 | 3742.0 | 158.0 | 3742.0 | 0.9973418500000576 | 2.0 | 15.0 | 0.0 | 28.492036423622032 | 32.63528454234575 | 93.96733856201172 | 0.9652070403099059 | 4.999999999999999e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 34.653465000000004 | 19.933933 | 92.079704 | 14.586936999999999 | 0.031495462678612966 | 0.008552723810986405 | 0.04882960178918439 | 0.021028505007270725 | |||
| 19 | 18 | 692.0 | 0.0 | 3823.0 | 3823.0 | 81.0 | 3823.0 | 0.9972616600000594 | 2.0 | 15.0 | 0.0 | 25.052144294977193 | 30.43845731410265 | 125.72972106933594 | 1.7888946533203125 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 42.09162 | 22.177326 | 135.06197 | 26.311890000000002 | |||||||
| 20 | 19 | 724.0 | 0.0 | 3954.0 | 3954.0 | 131.0 | 3954.0 | 0.9971319700000624 | 3.0 | 60.0 | 0.0 | 22.733492869883776 | 25.48783858900171 | 85.9217758178711 | 1.1611932516098022 | 5e-05 | 0.0 | 5e-05 | 5e-05 | 36.707924 | 17.869907 | 97.16350600000001 | 17.332857 | |||||||
| 21 | 20 | 892.0 | 0.0 | 4624.0 | 4624.0 | 670.0 | 4624.0 | 0.9964686700000768 | 10.0 | 120.0 | 0.0 | 26.82198785878941 | 32.463568352909114 | 173.3283233642578 | 0.6274673938751221 | 5e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 40.43936 | 25.755894 | 179.22295 | 9.756477 | 0.03179729131830149 | 0.015935985961329873 | 0.06086315548294807 | 0.005561185698170447 | |||
| 22 | 21 | 1039.0 | 0.0 | 5212.0 | 5212.0 | 588.0 | 5212.0 | 0.9958865500000892 | 7.0 | 305.0 | 0.0 | 25.642627885957967 | 30.19485814602498 | 128.482666015625 | 0.874378502368927 | 5.0000000000000016e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 38.308285 | 20.612198 | 146.47842 | 12.9673605 | 0.01867095795375159 | 0.00486977181278684 | 0.025539715652921586 | 0.011338672893034526 | |||
| 23 | 22 | 1062.0 | 0.0 | 5306.0 | 5306.0 | 94.0 | 5306.0 | 0.9957934900000912 | 1.0 | 30.0 | 0.0 | 27.773966304633927 | 35.56498822181043 | 128.78677368164062 | 0.7781664133071899 | 4.999999999999999e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 37.376076 | 32.84612 | 134.23588999999998 | 10.897138 | |||||||
| 24 | 23 | 1121.0 | 0.0 | 5540.0 | 5540.0 | 234.0 | 5540.0 | 0.9955618300000963 | 5.0 | 65.0 | 0.0 | 27.48228641214041 | 31.674095369737838 | 117.5129852294922 | 1.5340145826339722 | 5.000000000000001e-05 | 6.776263578034403e-21 | 5e-05 | 5e-05 | 47.458725 | 30.501372999999997 | 172.14487 | 23.663988 | |||||||
| 25 | 24 | 1159.0 | 0.0 | 5692.0 | 5692.0 | 152.0 | 5692.0 | 0.9954113500000996 | 0.0 | 0.0 | 0.0 | 22.548715029892172 | 29.715242718167897 | 135.16885375976562 | 1.177747130393982 | 4.999999999999999e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 37.85986 | 22.92256 | 141.62851 | 18.90979 | |||||||
| 26 | 25 | 1211.0 | 0.0 | 5901.0 | 5901.0 | 209.0 | 5901.0 | 0.995204440000104 | 3.0 | 55.0 | 0.0 | 20.993866240748996 | 25.480314720786012 | 95.07637786865234 | 0.8926278352737427 | 5.0000000000000016e-05 | 1.3552527156068802e-20 | 5e-05 | 5e-05 | 28.66507 | 14.327967000000001 | 80.878 | 13.025402 |