coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-08 02:16:32 +02:00

Author	SHA1	Message	Date
Zach Dwiel	d0248e03c6	add meaningful error message in the event that the action space is not one that can be used (#151 )	2018-12-11 09:09:24 +02:00
Itai Caspi	3fd433ffab	fix ddpg head (#78 )	2018-11-09 08:17:04 -08:00
Itai Caspi	83e0b09a6a	adding the missing export_onnx_graph parameter to task parameters (#73 )	2018-11-08 12:52:42 +02:00
Itai Caspi	811152126c	Export graph to ONNX (#61 ) Implements the ONNX graph exporting feature. Currently does not work for NAF, C51 and A3C_LSTM due to unsupported TF layers in the tf2onnx library.	2018-11-06 10:55:21 +02:00
Sina Afrooze	a888226641	Move embedder, middleware, and head parameters to framework agnostic modules. (#45 ) Part of #28	2018-10-29 14:46:40 -07:00
Shadi Endrawis	51726a5b80	network_imporvements branch merge	2018-10-02 13:43:36 +03:00
Gal Leibovich	72ea933384	bug-fix for clipped_ppo not logging several signals + small cleanup	2018-10-02 14:22:37 +03:00
itaicaspi-intel	d3f97cd93b	initial CIL implementation (WIP)	2018-09-13 15:29:29 +03:00
itaicaspi-intel	a9bd1047c4	load and save function for non-episodic replay buffers + carla improvements + network bug fixes	2018-09-12 15:26:16 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
itaicaspi-intel	2c62a40466	bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues	2018-09-02 13:38:16 +03:00
Gal Leibovich	ebe574e463	add missing hidden layer in rainbow_q_head	2018-08-30 19:34:27 +03:00
Gal Leibovich	ea294de7fd	adding dueling support for rainbow dqn (now only missing n-step)	2018-08-30 18:15:59 +03:00
Gal Leibovich	d2623c0eee	bug-fix in dueling dqn	2018-08-30 18:14:53 +03:00
Gal Leibovich	bbe7ac3338	Rainbow DQN agent (WIP - still missing dueling and n-step) + adding support for Prioritized ER for C51	2018-08-30 18:14:53 +03:00
Gal Leibovich	1aa2ab0590	parameter noise exploration - using Noisy Nets	2018-08-27 18:19:01 +03:00
itaicaspi-intel	658b437079	removing datasets + imports optimization	2018-08-27 10:54:11 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

18 Commits