1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 11:40:18 +01:00
Commit Graph

302 Commits

Author SHA1 Message Date
Zach Dwiel
0812a94fbd first pass at kubernetes 2018-10-23 16:47:46 -04:00
Zach Dwiel
3328b25549 reenable redis; better error message 2018-10-23 16:47:46 -04:00
Zach Dwiel
009cf670f3 fix simple typos; temporarily disable redis in rollout worker 2018-10-23 16:47:46 -04:00
Zach Dwiel
f5b7122d56 weight for checkpoint before trying to start rollout worker 2018-10-23 16:47:46 -04:00
Zach Dwiel
4352d6735d add training worker 2018-10-23 16:47:46 -04:00
Ajay Deshpande
28926bf2a4 Changing parameters 2018-10-23 16:47:46 -04:00
Ajay Deshpande
c2991819b4 Adding right arguments to the agent 2018-10-23 16:46:04 -04:00
Ajay Deshpande
ad7f031031 Adding dockerfile 2018-10-23 16:46:04 -04:00
Ajay Deshpande
ce9838a7d6 Adding kubernetes orchestrator for rollouts, adding requirements for incremental docker builds 2018-10-23 16:46:04 -04:00
Zach Dwiel
6541bc76b9 working checkpoints 2018-10-23 16:41:57 -04:00
Zach Dwiel
433bc3e27b standardizing variable access 2018-10-23 16:40:33 -04:00
Zach Dwiel
e34b9ae9cf allow specifying preset as a commandline parameter to rollout worker 2018-10-23 16:40:33 -04:00
Zach Dwiel
3714d8ec80 extract functions display_all_presets_and_exit, expand_preset 2018-10-23 16:40:33 -04:00
Ajay Deshpande
21f8ca3978 Removing comments and pytests 2018-10-23 16:40:33 -04:00
Ajay Deshpande
5a54f67a63 Adding distributed experience replay 2018-10-23 16:40:33 -04:00
Zach Dwiel
747000647f add dockerfile 2018-10-23 16:40:33 -04:00
Zach Dwiel
bc664c4169 add the first pass of rollout_worker.py 2018-10-23 16:40:33 -04:00
Zach Dwiel
61ed6b8ce4 add better defaults to TaskParameters 2018-10-23 16:40:33 -04:00
Zach Dwiel
5758c2f23e typo; increased detail in comment 2018-10-23 16:35:06 -04:00
Zach Dwiel
a1295d16b3 first pass that transition collection interface 2018-10-23 16:35:06 -04:00
Zach Dwiel
dc77c54ad9 add to gitignore 2018-10-23 16:35:06 -04:00
Zach Dwiel
9f1f9e5ab4 replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions) 2018-10-23 16:34:38 -04:00
Zach Dwiel
cccfe88f9b remove unused method: update_last_transition_info 2018-10-23 16:34:38 -04:00
Zach Dwiel
fb21251157 add horizontal scaling document 2018-10-23 16:34:38 -04:00
Gal Leibovich
5a8da90d32 bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter') 2018-10-21 17:29:10 +03:00
Shadi Endrawis
364168490f checkpointing fix 2018-10-07 20:06:08 +03:00
Gal Novik
5c4f9d58dd renamed quick start guide tutorial 2018-10-03 18:15:29 +03:00
Shadi Endrawis
f7990d4003 trace tests update 2018-10-02 17:55:16 +03:00
Shadi Endrawis
51726a5b80 network_imporvements branch merge 2018-10-02 13:43:36 +03:00
Gal Leibovich
72ea933384 bug-fix for clipped_ppo not logging several signals + small cleanup 2018-10-02 14:22:37 +03:00
itaicaspi-intel
73cc6e39d0 bug fix for clipped ppo for discrete controls 2018-09-18 10:40:53 +03:00
Gal Novik
abaa58b559 human agent will exit when human control not supported by environment; jupyter notebooks fixes 2018-09-17 16:00:00 +03:00
itaicaspi-intel
bb76c5c726 CARLA cleanups + calculating the distance to goal 2018-09-16 16:37:04 +03:00
itaicaspi-intel
6797824892 bug fixes in the CARLA dataset downloader and extractor 2018-09-16 14:27:22 +03:00
itaicaspi-intel
23a9f00e28 fix for human control 2018-09-16 12:43:15 +03:00
itaicaspi-intel
cf892463e2 updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset 2018-09-16 12:07:11 +03:00
itaicaspi-intel
d3c8a5d7c1 remove some accidentaly committed files 2018-09-14 18:22:04 +03:00
itaicaspi-intel
f8d3574b8c updated CARLA to allow the usage of predefined experiment suites 2018-09-14 18:07:24 +03:00
itaicaspi-intel
e8a2b679d1 using the CoRL2017 experiment suite for CARLA_CIL 2018-09-13 16:59:22 +03:00
itaicaspi-intel
06c969951e adding docker and jenkins files 2018-09-13 16:07:47 +03:00
itaicaspi-intel
d3f97cd93b initial CIL implementation (WIP) 2018-09-13 15:29:29 +03:00
itaicaspi-intel
99649c1626 progress bar update 2018-09-13 15:03:24 +03:00
itaicaspi-intel
607ef17431 added a simple progress bar implementation 2018-09-13 14:21:38 +03:00
itaicaspi-intel
fa79d8d365 Carla updates 2018-09-13 11:47:36 +03:00
itaicaspi-intel
fa4895f840 new traces 2018-09-13 11:47:36 +03:00
Zach Dwiel
673911ff7f very minor cleanup 2018-09-12 10:51:56 -04:00
itaicaspi-intel
a16d724963 removing some of the presets from the trace tests + more robust replay buffer loading 2018-09-12 15:26:16 +03:00
itaicaspi-intel
171fe97a3a imitation related bug fixes 2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4 load and save function for non-episodic replay buffers + carla improvements + network bug fixes 2018-09-12 15:26:16 +03:00
Itai Caspi
d59a700248 updated benchmarks for pong and breakout for dueling ddqn with PER 2018-09-06 14:05:46 +03:00