coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 01:46:31 +02:00

Author	SHA1	Message	Date
Zach Dwiel	e34b9ae9cf	allow specifying preset as a commandline parameter to rollout worker	2018-10-23 16:40:33 -04:00
Zach Dwiel	3714d8ec80	extract functions display_all_presets_and_exit, expand_preset	2018-10-23 16:40:33 -04:00
Ajay Deshpande	21f8ca3978	Removing comments and pytests	2018-10-23 16:40:33 -04:00
Ajay Deshpande	5a54f67a63	Adding distributed experience replay	2018-10-23 16:40:33 -04:00
Zach Dwiel	747000647f	add dockerfile	2018-10-23 16:40:33 -04:00
Zach Dwiel	bc664c4169	add the first pass of rollout_worker.py	2018-10-23 16:40:33 -04:00
Zach Dwiel	61ed6b8ce4	add better defaults to TaskParameters	2018-10-23 16:40:33 -04:00
Zach Dwiel	5758c2f23e	typo; increased detail in comment	2018-10-23 16:35:06 -04:00
Zach Dwiel	a1295d16b3	first pass that transition collection interface	2018-10-23 16:35:06 -04:00
Zach Dwiel	dc77c54ad9	add to gitignore	2018-10-23 16:35:06 -04:00
Zach Dwiel	9f1f9e5ab4	replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)	2018-10-23 16:34:38 -04:00
Zach Dwiel	cccfe88f9b	remove unused method: update_last_transition_info	2018-10-23 16:34:38 -04:00
Zach Dwiel	fb21251157	add horizontal scaling document	2018-10-23 16:34:38 -04:00
Gal Leibovich	5a8da90d32	bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter')	2018-10-21 17:29:10 +03:00
Shadi Endrawis	364168490f	checkpointing fix	2018-10-07 20:06:08 +03:00
Gal Novik	5c4f9d58dd	renamed quick start guide tutorial	2018-10-03 18:15:29 +03:00
Shadi Endrawis	f7990d4003	trace tests update	2018-10-02 17:55:16 +03:00
Shadi Endrawis	51726a5b80	network_imporvements branch merge	2018-10-02 13:43:36 +03:00
Gal Leibovich	72ea933384	bug-fix for clipped_ppo not logging several signals + small cleanup	2018-10-02 14:22:37 +03:00
itaicaspi-intel	73cc6e39d0	bug fix for clipped ppo for discrete controls	2018-09-18 10:40:53 +03:00
Gal Novik	abaa58b559	human agent will exit when human control not supported by environment; jupyter notebooks fixes	2018-09-17 16:00:00 +03:00
itaicaspi-intel	bb76c5c726	CARLA cleanups + calculating the distance to goal	2018-09-16 16:37:04 +03:00
itaicaspi-intel	6797824892	bug fixes in the CARLA dataset downloader and extractor	2018-09-16 14:27:22 +03:00
itaicaspi-intel	23a9f00e28	fix for human control	2018-09-16 12:43:15 +03:00
itaicaspi-intel	cf892463e2	updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset	2018-09-16 12:07:11 +03:00
itaicaspi-intel	d3c8a5d7c1	remove some accidentaly committed files	2018-09-14 18:22:04 +03:00
itaicaspi-intel	f8d3574b8c	updated CARLA to allow the usage of predefined experiment suites	2018-09-14 18:07:24 +03:00
itaicaspi-intel	e8a2b679d1	using the CoRL2017 experiment suite for CARLA_CIL	2018-09-13 16:59:22 +03:00
itaicaspi-intel	06c969951e	adding docker and jenkins files	2018-09-13 16:07:47 +03:00
itaicaspi-intel	d3f97cd93b	initial CIL implementation (WIP)	2018-09-13 15:29:29 +03:00
itaicaspi-intel	99649c1626	progress bar update	2018-09-13 15:03:24 +03:00
itaicaspi-intel	607ef17431	added a simple progress bar implementation	2018-09-13 14:21:38 +03:00
itaicaspi-intel	fa79d8d365	Carla updates	2018-09-13 11:47:36 +03:00
itaicaspi-intel	fa4895f840	new traces	2018-09-13 11:47:36 +03:00
Zach Dwiel	673911ff7f	very minor cleanup	2018-09-12 10:51:56 -04:00
itaicaspi-intel	a16d724963	removing some of the presets from the trace tests + more robust replay buffer loading	2018-09-12 15:26:16 +03:00
itaicaspi-intel	171fe97a3a	imitation related bug fixes	2018-09-12 15:26:16 +03:00
itaicaspi-intel	a9bd1047c4	load and save function for non-episodic replay buffers + carla improvements + network bug fixes	2018-09-12 15:26:16 +03:00
Itai Caspi	d59a700248	updated benchmarks for pong and breakout for dueling ddqn with PER	2018-09-06 14:05:46 +03:00
Gal Leibovich	08a557bfd1	updated the benchmarks for space invaders with dueling ddqn variants	2018-09-06 12:13:49 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
Shadi Endrawis	7086492127	parallel trace tests fix	2018-09-03 20:47:10 +03:00
itaicaspi-intel	2c62a40466	bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues	2018-09-02 13:38:16 +03:00
Itai Caspi	3a399d1361	Tensorflow 1.10 and python 3.6 (#104 ) * updating setup.py to install tensorflow 1.10 both on cpu and on gpu * allow python 3.6	2018-09-02 10:12:00 +03:00
Gal Leibovich	5aca3a5ed1	Update README.md	2018-08-30 23:33:44 +03:00
Itai Caspi	55c3034f4d	Update README.md	2018-08-30 23:25:10 +03:00
Itai Caspi	e5526b98f8	Update README.md	2018-08-30 22:58:37 +03:00
Gal Leibovich	d862a3be83	rainbow dqn hyper-parameter updates	2018-08-30 20:41:38 +03:00
Shadi Endrawis	07db625987	Running trace tests in parallel + other small fixes	2018-08-30 19:35:10 +03:00
Gal Leibovich	ebe574e463	add missing hidden layer in rainbow_q_head	2018-08-30 19:34:27 +03:00

1 2 3 4

191 Commits