mirror of
https://github.com/gryf/coach.git
synced 2025-12-18 11:40:18 +01:00
Update README.md
This commit is contained in:
@@ -1,7 +1,7 @@
|
||||
# DQN
|
||||
|
||||
Each experiment uses 3 seeds.
|
||||
The parameters used for DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1607.05077.pdf).
|
||||
The parameters used for DQN are the same parameters as described in the [original paper](https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf), except for the optimizer (changed to ADAM) and learning rate (1e-4) used.
|
||||
|
||||
### Breakout DQN - single worker
|
||||
|
||||
|
||||
Reference in New Issue
Block a user