coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00

Author	SHA1	Message	Date
Ajay Deshpande	16b3e99f37	Setup basic CI flow (#38 ) Adds automated running of unit, integration tests (and optionally longer running tests)	2018-10-24 18:27:58 -07:00
zach dwiel	3ba0df7d07	update GraphManager.act specified return type	2018-10-23 19:58:17 -04:00
Zach Dwiel	700a175902	rename save_checkpoint_secs -> checkpoint_save_secs	2018-10-23 17:10:58 -04:00
Zach Dwiel	9804b033a2	rename save_checkpoint_dir -> checkpoint_save_dir	2018-10-23 17:10:58 -04:00
Zach Dwiel	201a2237a1	restructure looping mechanism inGraphManager	2018-10-23 17:10:58 -04:00
Zach Dwiel	52560a2aae	introduce property GraphManager.current_step_counter	2018-10-23 17:10:04 -04:00
Zach Dwiel	776c94d551	reorder methods in GraphManager	2018-10-23 17:10:04 -04:00
Zach Dwiel	496a516de1	rename GraphManager.sync_graph -> sync	2018-10-23 17:08:29 -04:00
Zach Dwiel	5fee48dcfd	remove argument keep_networks_in_sync from GraphManager.act, and move this feature into the only place that activated it: GraphManager.train_and_act	2018-10-23 17:08:29 -04:00
Zach Dwiel	b2d864a5bd	remove out of date documentation	2018-10-23 17:08:29 -04:00
Zach Dwiel	d32d909238	move only invocation of GraphManager.handle_episode_ended inline	2018-10-23 17:08:29 -04:00
Zach Dwiel	18d84c5037	remove unnecessary timers from GraphManager	2018-10-23 16:58:17 -04:00
Zach Dwiel	cd30efe52e	remove unnecessary test result is None in GraphManager.act	2018-10-23 16:57:43 -04:00
Zach Dwiel	35d67cbd9b	use phase context in GraphManager.evaluate	2018-10-23 16:57:43 -04:00
Zach Dwiel	d3c341147e	simplify GraphManager.act by removing arguments: continue_until_game_over and return_on_game_over	2018-10-23 16:57:43 -04:00
Zach Dwiel	8be980912c	fixed typo from earlier commit	2018-10-23 16:57:43 -04:00
Zach Dwiel	517aac163a	introduce graph_manager.phase_context; make sure that calls to graph_manager.train automatically set training phase	2018-10-23 16:57:43 -04:00
Zach Dwiel	7382a142bb	remove unused steps parameter from GraphManager.train	2018-10-23 16:57:06 -04:00
Zach Dwiel	ad68fa263d	remove property GraphManager.training_start_time	2018-10-23 16:57:05 -04:00
Zach Dwiel	01f3a0594b	remove return values from GraphManager.act	2018-10-23 16:57:05 -04:00
Zach Dwiel	b02f269464	graph_manager:heatup uses total_steps_counters looping mechanism like other loops. graph_manager:act no longer needs to return any values	2018-10-23 16:57:05 -04:00
Ajay Deshpande	0e121c5762	Ignoring redis sub if testing	2018-10-23 16:55:37 -04:00
Ajay Deshpande	a7f5442015	Adding should_train helper and should_train in graph_manager	2018-10-23 16:54:43 -04:00
Balaji Subramaniam	844a5af831	Make distributed coach work end-to-end. - With data store, memory backend and orchestrator interfaces.	2018-10-23 16:54:43 -04:00
Zach Dwiel	9f92064e67	cleanup graph_manager:act	2018-10-23 16:53:32 -04:00
Zach Dwiel	ed3a3b39be	add comments	2018-10-23 16:52:16 -04:00
Zach Dwiel	13d81f65b9	add redis options to training worker	2018-10-23 16:47:46 -04:00
Zach Dwiel	6541bc76b9	working checkpoints	2018-10-23 16:41:57 -04:00
Zach Dwiel	433bc3e27b	standardizing variable access	2018-10-23 16:40:33 -04:00
Gal Leibovich	5a8da90d32	bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter')	2018-10-21 17:29:10 +03:00
Shadi Endrawis	364168490f	checkpointing fix	2018-10-07 20:06:08 +03:00
Shadi Endrawis	51726a5b80	network_imporvements branch merge	2018-10-02 13:43:36 +03:00
Zach Dwiel	673911ff7f	very minor cleanup	2018-09-12 10:51:56 -04:00
itaicaspi-intel	171fe97a3a	imitation related bug fixes	2018-09-12 15:26:16 +03:00
Itai Caspi	72a1d9d426	Itaicaspi/episode reset refactoring (#105 ) * reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file	2018-09-04 15:07:54 +03:00
itaicaspi-intel	658b437079	removing datasets + imports optimization	2018-08-27 10:54:11 +03:00
Gal Leibovich	c1f428666e	bug-fix for checkpointing for single-worker algorithms	2018-08-19 20:17:15 +03:00
Itai Caspi	1de04d6fee	updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets	2018-08-16 18:23:32 +03:00
Gal Novik	19ca5c24b1	pre-release 0.10.0	2018-08-13 17:11:34 +03:00

39 Commits