Gal Leibovich
6e08c55ad5
Enabling-more-agents-for-Batch-RL-and-cleanup ( #258 )
...
allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups
2019-03-21 16:10:29 +02:00
..
2019-03-21 16:10:29 +02:00
2019-03-21 16:10:29 +02:00
2019-03-19 18:07:09 +02:00
2018-11-27 22:43:40 +02:00
2019-01-14 15:02:53 -05:00
2018-12-17 10:08:54 +02:00
2019-01-14 15:02:53 -05:00
2019-03-21 16:10:29 +02:00
2019-03-21 16:10:29 +02:00
2019-03-21 16:10:29 +02:00
2019-02-26 13:53:12 -08:00
2019-03-21 16:10:29 +02:00
2019-03-18 11:21:43 +02:00
2018-10-02 17:55:16 +03:00
2018-12-17 21:36:27 +02:00
2018-08-13 17:11:34 +03:00
2019-03-19 18:07:09 +02:00
2018-11-23 18:05:44 -08:00
2019-03-17 16:28:09 +02:00
2019-03-19 18:07:09 +02:00
2019-03-10 13:15:14 +02:00
2018-08-27 10:54:11 +03:00
2019-03-19 18:07:09 +02:00
2019-03-19 18:07:09 +02:00
2018-08-13 17:11:34 +03:00
2018-11-18 18:02:55 +02:00
2019-03-17 16:28:09 +02:00
2018-11-25 08:33:09 +02:00
2018-11-27 22:43:40 +02:00
2018-08-13 17:11:34 +03:00
2019-03-19 18:07:09 +02:00
2019-02-26 13:53:12 -08:00
2019-01-16 17:38:11 -08:00