Update README.md

2026-07-09 02:46:33 +02:00 · 2018-08-19 11:02:45 +03:00
parent 0be4a42701
commit 23d2945bf8
1 changed files with 1 additions and 1 deletions
@@ -1,7 +1,7 @@
 # DQN

 Each experiment uses 3 seeds.
-The parameters used for DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1607.05077.pdf).
+The parameters used for DQN are the same parameters as described in the [original paper](https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf), except for the optimizer (changed to ADAM) and learning rate (1e-4) used.

 ### Breakout DQN - single worker