1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 19:20:19 +01:00
Commit Graph

172 Commits

Author SHA1 Message Date
itaicaspi-intel
73cc6e39d0 bug fix for clipped ppo for discrete controls 2018-09-18 10:40:53 +03:00
Gal Novik
abaa58b559 human agent will exit when human control not supported by environment; jupyter notebooks fixes 2018-09-17 16:00:00 +03:00
itaicaspi-intel
bb76c5c726 CARLA cleanups + calculating the distance to goal 2018-09-16 16:37:04 +03:00
itaicaspi-intel
6797824892 bug fixes in the CARLA dataset downloader and extractor 2018-09-16 14:27:22 +03:00
itaicaspi-intel
23a9f00e28 fix for human control 2018-09-16 12:43:15 +03:00
itaicaspi-intel
cf892463e2 updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset 2018-09-16 12:07:11 +03:00
itaicaspi-intel
d3c8a5d7c1 remove some accidentaly committed files 2018-09-14 18:22:04 +03:00
itaicaspi-intel
f8d3574b8c updated CARLA to allow the usage of predefined experiment suites 2018-09-14 18:07:24 +03:00
itaicaspi-intel
e8a2b679d1 using the CoRL2017 experiment suite for CARLA_CIL 2018-09-13 16:59:22 +03:00
itaicaspi-intel
06c969951e adding docker and jenkins files 2018-09-13 16:07:47 +03:00
itaicaspi-intel
d3f97cd93b initial CIL implementation (WIP) 2018-09-13 15:29:29 +03:00
itaicaspi-intel
99649c1626 progress bar update 2018-09-13 15:03:24 +03:00
itaicaspi-intel
607ef17431 added a simple progress bar implementation 2018-09-13 14:21:38 +03:00
itaicaspi-intel
fa79d8d365 Carla updates 2018-09-13 11:47:36 +03:00
itaicaspi-intel
fa4895f840 new traces 2018-09-13 11:47:36 +03:00
Zach Dwiel
673911ff7f very minor cleanup 2018-09-12 10:51:56 -04:00
itaicaspi-intel
a16d724963 removing some of the presets from the trace tests + more robust replay buffer loading 2018-09-12 15:26:16 +03:00
itaicaspi-intel
171fe97a3a imitation related bug fixes 2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4 load and save function for non-episodic replay buffers + carla improvements + network bug fixes 2018-09-12 15:26:16 +03:00
Itai Caspi
d59a700248 updated benchmarks for pong and breakout for dueling ddqn with PER 2018-09-06 14:05:46 +03:00
Gal Leibovich
08a557bfd1 updated the benchmarks for space invaders with dueling ddqn variants 2018-09-06 12:13:49 +03:00
Itai Caspi
72a1d9d426 Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
Shadi Endrawis
7086492127 parallel trace tests fix 2018-09-03 20:47:10 +03:00
itaicaspi-intel
2c62a40466 bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues 2018-09-02 13:38:16 +03:00
Itai Caspi
3a399d1361 Tensorflow 1.10 and python 3.6 (#104)
* updating setup.py to install tensorflow 1.10 both on cpu and on gpu

* allow python 3.6
2018-09-02 10:12:00 +03:00
Gal Leibovich
5aca3a5ed1 Update README.md 2018-08-30 23:33:44 +03:00
Itai Caspi
55c3034f4d Update README.md 2018-08-30 23:25:10 +03:00
Itai Caspi
e5526b98f8 Update README.md 2018-08-30 22:58:37 +03:00
Gal Leibovich
d862a3be83 rainbow dqn hyper-parameter updates 2018-08-30 20:41:38 +03:00
Shadi Endrawis
07db625987 Running trace tests in parallel + other small fixes 2018-08-30 19:35:10 +03:00
Gal Leibovich
ebe574e463 add missing hidden layer in rainbow_q_head 2018-08-30 19:34:27 +03:00
Gal Leibovich
ea294de7fd adding dueling support for rainbow dqn (now only missing n-step) 2018-08-30 18:15:59 +03:00
Gal Leibovich
d2623c0eee bug-fix in dueling dqn 2018-08-30 18:14:53 +03:00
Gal Leibovich
bbe7ac3338 Rainbow DQN agent (WIP - still missing dueling and n-step) + adding support for Prioritized ER for C51 2018-08-30 18:14:53 +03:00
itaicaspi-intel
fd2f4b0852 bug fix in HRL HER memory + some small improvements 2018-08-29 14:36:18 +03:00
Gal Leibovich
1aa2ab0590 parameter noise exploration - using Noisy Nets 2018-08-27 18:19:01 +03:00
itaicaspi-intel
658b437079 removing datasets + imports optimization 2018-08-27 10:54:11 +03:00
Gal Leibovich
d826382b11 removing test from Doom_Health_Supreme_DFP + relaxing time limit on reward tests 2018-08-26 18:42:41 +03:00
Gal Leibovich
2021490caa small adjustment to golden tests + fixes for Doom_Health_DFP and Doom_Health_Supreme_DFP 2018-08-26 18:42:41 +03:00
Itai Caspi
3fd0bf4f0f Update README.md 2018-08-26 12:09:46 +03:00
Gal Leibovich
9bb7bd2e9c bug-fix in local_batch_run_coach and rename to run_multiple_seeds 2018-08-23 14:39:11 +03:00
Gal Leibovich
a4471389a4 brightened starcraft.gif 2018-08-20 13:50:09 +03:00
Gal Leibovich
904570000a Update README.md 2018-08-20 12:04:29 +03:00
Gal Leibovich
5e275e9795 update starcraft gif 2018-08-20 11:49:19 +03:00
Shadi Endrawis
3abb6cd415 Trace tests update 2018-08-20 13:01:30 +03:00
Gal Leibovich
c1f428666e bug-fix for checkpointing for single-worker algorithms 2018-08-19 20:17:15 +03:00
Itai Caspi
9f599f38cf Update README.md 2018-08-19 13:09:06 +03:00
Itai Caspi
c5165cd7d6 benchmarks and pip package updates 2018-08-19 14:23:20 +03:00
Gal Leibovich
23d2945bf8 Update README.md 2018-08-19 11:02:45 +03:00
Itai Caspi
0be4a42701 updates needed for the pip package 2018-08-19 10:39:03 +03:00