RL in Large Discrete Action Spaces - Wolpertinger Agent (#394) · 138ced23ba - coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 18:06:31 +02:00

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394)

* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter

This commit is contained in:

Gal Leibovich

2019-09-08 12:53:49 +03:00

committed by

GitHub

parent fc50398544

commit 138ced23ba

46 changed files with 1193 additions and 51 deletions

docs/_images/algorithms.png

BIN

View File

Binary file not shown.

Before

Width: | Height: | Size: 60 KiB

After

Width: | Height: | Size: 63 KiB

docs/_images/wolpertinger.png

BIN

View File

Binary file not shown.

After

Width: | Height: | Size: 49 KiB