mirror of
https://github.com/gryf/coach.git
synced 2026-04-20 23:41:24 +02:00
11faf19649
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)