coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-08 02:16:32 +02:00

Files

T

History

Gal Leibovich 138ced23ba RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter

2019-09-08 12:53:49 +03:00

acer_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

actor_critic_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

agent_interface.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

agent.html

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

2019-09-08 12:53:49 +03:00

bc_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

categorical_dqn_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

cil_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

clipped_ppo_agent.html

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

2019-09-08 12:53:49 +03:00

ddpg_agent.html

Batch RL Tutorial (#372 )

2019-07-14 18:43:48 +03:00

dfp_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

dqn_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

mmc_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

n_step_q_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

naf_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

nec_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

pal_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

policy_gradients_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

ppo_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

qr_dqn_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

rainbow_dqn_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

soft_actor_critic_agent.html

fixed release version (#333 )

2019-05-28 11:11:15 +03:00

td3_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

value_optimization_agent.html

TD3 (#338 )

2019-06-16 11:11:21 +03:00

wolpertinger_agent.html

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

2019-09-08 12:53:49 +03:00