coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-12 11:45:45 +01:00

Author	SHA1	Message	Date
Sina Afrooze	67eb9e4c28	Adding checkpointing framework (#74 ) * Adding checkpointing framework as well as mxnet checkpointing implementation. - MXNet checkpoint for each network is saved in a separate file. * Adding checkpoint restore for mxnet to graph-manager * Add unit-test for get_checkpoint_state() * Added match.group() to fix unit-test failing on CI * Added ONNX export support for MXNet	2018-11-19 19:45:49 +02:00
Thom Lane	7ba1a4393f	Channel order transpose, for image embedder. Updated unit test. (#87 )	2018-11-19 15:39:03 +02:00
Thom Lane	81bac050d7	Added Custom Initialisation for MXNet Heads (#86 ) * Added NormalizedRSSInitializer, using same method as TensorFlow backend, but changed name since ‘columns’ have different meaning in dense layer weight matrix in MXNet. * Added unit test for NormalizedRSSInitializer.	2018-11-16 08:15:43 -08:00
Scott Leishman	524f8436a2	create per environment Dockerfiles. (#70 ) * create per environment Dockerfiles. Adjust CI setup to better parallelize runs. Fix a couple of issues in golden and trace tests. Update a few of the docs. * bugfix in mmc agent. Also install kubectl for CI, update badge branch. * remove integration test parallelism.	2018-11-14 07:40:22 -08:00
Gal Leibovich	49dea39d34	N-step returns for rainbow (#67 ) * n_step returns for rainbow * Rename CartPole_PPO -> CartPole_ClippedPPO	2018-11-07 18:33:08 +02:00
Sina Afrooze	5fadb9c18e	Adding mxnet components to rl_coach/architectures (#60 ) Adding mxnet components to rl_coach architectures. - Supports PPO and DQN - Tested with CartPole_PPO and CarPole_DQN - Normalizing filters don't work right now (see #49) and are disabled in CartPole_PPO preset - Checkpointing is disabled for MXNet	2018-11-07 17:07:15 +02:00
Sina Afrooze	95b4fc6888	Added ability to switch between tensorflow and mxnet using -f commandline argument. (#48 ) NOTE: tensorflow framework works fine if mxnet is not installed in env, but mxnet will not work if tensorflow is not installed because of the code in network_wrapper.	2018-10-30 15:29:34 -07:00
Ajay Deshpande	16b3e99f37	Setup basic CI flow (#38 ) Adds automated running of unit, integration tests (and optionally longer running tests)	2018-10-24 18:27:58 -07:00
zach dwiel	430ca198e5	convert golden tests into pytest format	2018-10-23 19:58:17 -04:00
zach dwiel	787ab42578	remove extra call to super().store_episode	2018-10-23 19:58:17 -04:00
Zach Dwiel	201a2237a1	restructure looping mechanism inGraphManager	2018-10-23 17:10:58 -04:00
Zach Dwiel	fbaf19543e	capture stdout during preset tests	2018-10-23 16:57:43 -04:00
Zach Dwiel	517aac163a	introduce graph_manager.phase_context; make sure that calls to graph_manager.train automatically set training phase	2018-10-23 16:57:43 -04:00
Zach Dwiel	97f608ee5e	reorder failing presets	2018-10-23 16:57:05 -04:00
Zach Dwiel	bfc320cf83	disable failing tests for now	2018-10-23 16:57:05 -04:00
Zach Dwiel	b5305bd075	update dockerfile	2018-10-23 16:52:16 -04:00
Zach Dwiel	950f261201	extract method all_presets	2018-10-23 16:52:16 -04:00
Zach Dwiel	a54ef2757f	ignore deprecation warnings in test logging	2018-10-23 16:51:48 -04:00
Zach Dwiel	acc7f70de3	enumerate each preset as its own test	2018-10-23 16:51:48 -04:00
Shadi Endrawis	f7990d4003	trace tests update	2018-10-02 17:55:16 +03:00
itaicaspi-intel	a16d724963	removing some of the presets from the trace tests + more robust replay buffer loading	2018-09-12 15:26:16 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
Shadi Endrawis	7086492127	parallel trace tests fix	2018-09-03 20:47:10 +03:00
Shadi Endrawis	07db625987	Running trace tests in parallel + other small fixes	2018-08-30 19:35:10 +03:00
Gal Leibovich	d826382b11	removing test from Doom_Health_Supreme_DFP + relaxing time limit on reward tests	2018-08-26 18:42:41 +03:00
Gal Leibovich	2021490caa	small adjustment to golden tests + fixes for Doom_Health_DFP and Doom_Health_Supreme_DFP	2018-08-26 18:42:41 +03:00
Shadi Endrawis	3abb6cd415	Trace tests update	2018-08-20 13:01:30 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

28 Commits