coach

gryf/coach

Fork 0

mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00

Commit Graph

Author	SHA1	Message	Date
Gal Leibovich	138ced23ba	RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 ) * Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter	2019-09-08 12:53:49 +03:00
Gal Leibovich	7eb884c5b2	TD3 (#338 )	2019-06-16 11:11:21 +03:00

Author

SHA1

Message

Date

Gal Leibovich

138ced23ba

RL in Large Discrete Action Spaces - Wolpertinger Agent (#394 )

* Currently this is specific to the case of discretizing a continuous action space. Can easily be adapted to other case by feeding the kNN otherwise, and removing the usage of a discretizing output action filter

2019-09-08 12:53:49 +03:00

Gal Leibovich

7eb884c5b2

TD3 (#338 )

2019-06-16 11:11:21 +03:00

2 Commits