1
0
mirror of https://github.com/gryf/coach.git synced 2026-01-27 18:45:45 +01:00
Files
coach/rl_coach/traces/Atari_NStepQ_space_invaders/trace.csv
Itai Caspi 72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00

4.0 KiB

1Episode #Training IterIn HeatupER #TransitionsER #EpisodesEpisode LengthTotal stepsEpsilonShaped Training RewardTraining RewardUpdate Target NetworkEvaluation RewardShaped Evaluation RewardSuccess RateLoss/MeanLoss/StdevLoss/MaxLoss/MinLearning Rate/MeanLearning Rate/StdevLearning Rate/MaxLearning Rate/MinGrads (unclipped)/MeanGrads (unclipped)/StdevGrads (unclipped)/MaxGrads (unclipped)/MinEntropy/MeanEntropy/StdevEntropy/MaxEntropy/MinQ/MeanQ/StdevQ/MaxQ/MinQ Values/MeanQ Values/StdevQ Values/MaxQ Values/MinValue Loss/MeanValue Loss/StdevValue Loss/MaxValue Loss/Min
210.01.0486.01.0486.0486.00.50.0
320.01.087.01.087.0573.00.50.0
430.01.0149.01.0149.0722.00.50.0
540.01.0335.01.0335.01057.00.50.0
6530.00.0152.01.0152.01209.00.49851039999999942.015.00.00.018060150.0311546810.11256089999999999-0.031536460.0405289980.154951840.77666817.3362275999999995e-06
7684.00.0270.01.0270.01479.00.49586439999999828.0120.00.00.048326440.0294331220000000030.116728835-0.0139504960000000010.0547053660.142911700000000030.708544250.00038360796
87149.00.0324.01.0324.01803.00.49268919999999689.0120.00.00.065199380.0369967739999999960.20431875-0.00064791380.091925950.231947700000000030.88362519.27005e-05
98197.00.0237.01.0237.02040.00.49036659999999586.070.00.00.0819935050.0317504740.170676110.018930280.066099780.172768280.845184270.00049587624
109231.00.0171.01.0171.02211.00.48869079999999510.00.00.00.065612190.0275784489999999980.165835200000000020.00239474260000000030.00455686030.00407288099999999960.0192724410.0012047348
1110352.00.0604.01.0604.02815.00.482771599999992516.0240.00.00.0540655550.0297701170000000020.14167584-0.0251651850.069840260.194316920.89474959999999991.5888494e-05
1211399.00.0232.01.0232.03047.00.48049799999999154.025.00.00.093173970.0372683029999999960.18794140.0172476360.045072530.134256899999999980.77889280.0016535529999999999
1312430.00.0154.01.0154.03201.00.47898879999999092.015.00.00.0603745840.0269837250.13273580.000173253469999999970.037001120.156032350.85843240000000019.365492e-05
1413464.00.0169.01.0169.03370.00.47733259999999023.060.00.00.070769120.0249603170000000030.1714890.0228485630.077088490.235285060.89994460.0009268887
1514502.00.0189.01.0189.03559.00.47548039999998944.050.00.00.081753715999999990.0597075630.23806223-0.00223889970.0803762750.218269380.90143370.000156738
1615530.00.0138.01.0138.03697.00.47412799999998881.025.00.00.065882585999999990.0317724570000000040.16913122-0.00330092460000000040.0376714770.173142730.920359430.0008084267599999999
1716549.00.095.01.095.03792.00.47319699999998841.030.00.00.084195510.0217211340.134561260.0421787459999999960.0227736350.080620030.355144080.00152435
1817630.00.0404.01.0404.04196.00.46923779999998669.075.00.00.067398860.034674860.14949478-0.036150280.0630866140.174892050.70535640.00030611295
1918714.00.0420.01.0420.04616.00.465121799999984910.0160.00.00.067600590.0223863589999999980.14804420.01147467550.0659662560.186845880.909568850.00010415676
2019809.00.0473.01.0473.05089.00.46048639999998297.0135.00.00.0579702560.0202908070.135716870.0122757280.043316440.162508399999999970.88342670.00020417363000000002
2120850.00.0204.01.0204.05293.00.458487199999982053.020.00.00.071009580.0309876370000000020.148259770.0118108730000000010.0478517230.165099580.85929680000000010.00032094717999999996