1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00
Files
coach/rl_coach/agents
Gal Leibovich 4741b0b916 BCQ variant on top of DDQN (#276)
* kNN based model for predicting which actions to drop
* fix for seeds with batch rl
2019-04-16 17:06:23 +03:00
..
2018-08-13 17:11:34 +03:00
2019-03-17 15:33:28 +02:00
2019-04-16 17:06:23 +03:00
2019-03-19 18:07:09 +02:00
2019-04-16 17:06:23 +03:00
2018-09-12 15:26:16 +03:00
2019-03-19 18:07:09 +02:00