mirror of
https://github.com/gryf/coach.git
synced 2026-02-01 13:25:45 +01:00
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)