1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00

Update README.md

This commit is contained in:
Gal Leibovich
2018-08-19 11:02:45 +03:00
committed by GitHub
parent 0be4a42701
commit 23d2945bf8

View File

@@ -1,7 +1,7 @@
# DQN
Each experiment uses 3 seeds.
The parameters used for DQN are the same parameters as described in the [original paper](https://arxiv.org/abs/1607.05077.pdf).
The parameters used for DQN are the same parameters as described in the [original paper](https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf), except for the optimizer (changed to ADAM) and learning rate (1e-4) used.
### Breakout DQN - single worker