1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00
Files

Dueling DDQN with Prioritized Experience Replay

Each experiment uses 3 seeds and is trained for 10k environment steps. The parameters used for Dueling DDQN with PER are the same parameters as described in the following paper.

Breakout Dueling DDQN with PER - single worker

coach -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl breakout
Breakout Dueling DDQN with PER

Pong Dueling DDQN with PER - single worker

coach -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl pong
Pong Dueling DDQN with PER

Space Invaders Dueling DDQN with PER - single worker

coach -p Atari_Dueling_DDQN_with_PER_OpenAI -lvl space_invaders
Space Invaders Dueling DDQN with PER