1
0
mirror of https://github.com/gryf/coach.git synced 2026-02-01 21:35:45 +01:00
Commit Graph

19 Commits

Author SHA1 Message Date
Shadi Endrawis
7086492127 parallel trace tests fix 2018-09-03 20:47:10 +03:00
itaicaspi-intel
2c62a40466 bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues 2018-09-02 13:38:16 +03:00
Gal Leibovich
d862a3be83 rainbow dqn hyper-parameter updates 2018-08-30 20:41:38 +03:00
Shadi Endrawis
07db625987 Running trace tests in parallel + other small fixes 2018-08-30 19:35:10 +03:00
Gal Leibovich
ebe574e463 add missing hidden layer in rainbow_q_head 2018-08-30 19:34:27 +03:00
Gal Leibovich
ea294de7fd adding dueling support for rainbow dqn (now only missing n-step) 2018-08-30 18:15:59 +03:00
Gal Leibovich
d2623c0eee bug-fix in dueling dqn 2018-08-30 18:14:53 +03:00
Gal Leibovich
bbe7ac3338 Rainbow DQN agent (WIP - still missing dueling and n-step) + adding support for Prioritized ER for C51 2018-08-30 18:14:53 +03:00
itaicaspi-intel
fd2f4b0852 bug fix in HRL HER memory + some small improvements 2018-08-29 14:36:18 +03:00
Gal Leibovich
1aa2ab0590 parameter noise exploration - using Noisy Nets 2018-08-27 18:19:01 +03:00
itaicaspi-intel
658b437079 removing datasets + imports optimization 2018-08-27 10:54:11 +03:00
Gal Leibovich
d826382b11 removing test from Doom_Health_Supreme_DFP + relaxing time limit on reward tests 2018-08-26 18:42:41 +03:00
Gal Leibovich
2021490caa small adjustment to golden tests + fixes for Doom_Health_DFP and Doom_Health_Supreme_DFP 2018-08-26 18:42:41 +03:00
Gal Leibovich
9bb7bd2e9c bug-fix in local_batch_run_coach and rename to run_multiple_seeds 2018-08-23 14:39:11 +03:00
Shadi Endrawis
3abb6cd415 Trace tests update 2018-08-20 13:01:30 +03:00
Gal Leibovich
c1f428666e bug-fix for checkpointing for single-worker algorithms 2018-08-19 20:17:15 +03:00
Itai Caspi
1de04d6fee updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets 2018-08-16 18:23:32 +03:00
Gal Leibovich
ab5a81c7ee fix for dumping movies, without rendering, for pendulum_with_goals 2018-08-14 18:13:44 +03:00
Gal Novik
19ca5c24b1 pre-release 0.10.0 2018-08-13 17:11:34 +03:00