mirror of
https://github.com/gryf/coach.git
synced 2026-05-03 14:50:56 +02:00
11faf19649
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)