gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 18:06:31 +02:00

Files

T

History

Gal Leibovich 08a557bfd1 updated the benchmarks for space invaders with dueling ddqn variants

2018-09-06 12:13:49 +03:00

..

breakout_dueling_ddqn.png

Itaicaspi/episode reset refactoring (#105 )

2018-09-04 15:07:54 +03:00

pong_dueling_ddqn.png

Itaicaspi/episode reset refactoring (#105 )

2018-09-04 15:07:54 +03:00

README.md

benchmarks and pip package updates

2018-08-19 14:23:20 +03:00

space_invaders_dueling_ddqn.png

updated the benchmarks for space invaders with dueling ddqn variants

2018-09-06 12:13:49 +03:00

README.md

Dueling DDQN

Each experiment uses 3 seeds and is trained for 10k environment steps. The parameters used for Dueling DDQN are the same parameters as described in the original paper.

Pong Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl pong

Pong Dueling DDQN

Breakout Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl breakout

Breakout Dueling DDQN

Space Invaders Dueling DDQN - single worker

coach -p Atari_Dueling_DDQN -lvl space_invaders

Space Invaders Dueling DDQN