mirror of
https://github.com/gryf/coach.git
synced 2025-12-17 19:20:19 +01:00
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)
27 KiB
27 KiB