coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 18:06:31 +02:00

Files

T

Gal Leibovich 138ced23ba RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter

2019-09-08 12:53:49 +03:00

agents

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

2019-09-08 12:53:49 +03:00

architectures

Batch RL Tutorial (#372 )

2019-07-14 18:43:48 +03:00

data_stores

TD3 (#338 )

2019-06-16 11:11:21 +03:00

environments

TD3 (#338 )

2019-06-16 11:11:21 +03:00

exploration_policies

TD3 (#338 )

2019-06-16 11:11:21 +03:00

filters

TD3 (#338 )

2019-06-16 11:11:21 +03:00

memories

TD3 (#338 )

2019-06-16 11:11:21 +03:00

memory_backends

TD3 (#338 )

2019-06-16 11:11:21 +03:00

orchestrators

TD3 (#338 )

2019-06-16 11:11:21 +03:00

additional_parameters.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

core_types.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

spaces.html

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

2019-09-08 12:53:49 +03:00