mirror of
https://github.com/gryf/coach.git
synced 2025-12-18 11:40:18 +01:00
* bug fix - QR-DQN using error instead of abs-error in the quantile huber loss * improvement - QR-DQN sorting the quantile only once instead of batch_size times * new feature - adding the Breakout QRDQN preset (verified to achieve good results)
44 KiB
44 KiB