itaicaspi-intel
73cc6e39d0
bug fix for clipped ppo for discrete controls
2018-09-18 10:40:53 +03:00
Gal Novik
abaa58b559
human agent will exit when human control not supported by environment; jupyter notebooks fixes
2018-09-17 16:00:00 +03:00
itaicaspi-intel
bb76c5c726
CARLA cleanups + calculating the distance to goal
2018-09-16 16:37:04 +03:00
itaicaspi-intel
6797824892
bug fixes in the CARLA dataset downloader and extractor
2018-09-16 14:27:22 +03:00
itaicaspi-intel
23a9f00e28
fix for human control
2018-09-16 12:43:15 +03:00
itaicaspi-intel
cf892463e2
updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset
2018-09-16 12:07:11 +03:00
itaicaspi-intel
d3c8a5d7c1
remove some accidentaly committed files
2018-09-14 18:22:04 +03:00
itaicaspi-intel
f8d3574b8c
updated CARLA to allow the usage of predefined experiment suites
2018-09-14 18:07:24 +03:00
itaicaspi-intel
e8a2b679d1
using the CoRL2017 experiment suite for CARLA_CIL
2018-09-13 16:59:22 +03:00
itaicaspi-intel
06c969951e
adding docker and jenkins files
2018-09-13 16:07:47 +03:00
itaicaspi-intel
d3f97cd93b
initial CIL implementation (WIP)
2018-09-13 15:29:29 +03:00
itaicaspi-intel
99649c1626
progress bar update
2018-09-13 15:03:24 +03:00
itaicaspi-intel
607ef17431
added a simple progress bar implementation
2018-09-13 14:21:38 +03:00
itaicaspi-intel
fa79d8d365
Carla updates
2018-09-13 11:47:36 +03:00
itaicaspi-intel
fa4895f840
new traces
2018-09-13 11:47:36 +03:00
Zach Dwiel
673911ff7f
very minor cleanup
2018-09-12 10:51:56 -04:00
itaicaspi-intel
a16d724963
removing some of the presets from the trace tests + more robust replay buffer loading
2018-09-12 15:26:16 +03:00
itaicaspi-intel
171fe97a3a
imitation related bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4
load and save function for non-episodic replay buffers + carla improvements + network bug fixes
2018-09-12 15:26:16 +03:00
Itai Caspi
d59a700248
updated benchmarks for pong and breakout for dueling ddqn with PER
2018-09-06 14:05:46 +03:00
Gal Leibovich
08a557bfd1
updated the benchmarks for space invaders with dueling ddqn variants
2018-09-06 12:13:49 +03:00
Itai Caspi
72a1d9d426
Itaicaspi/episode reset refactoring ( #105 )
...
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()
* tests readme file and refactoring of policy optimization agent train function
* Update README.md
* Update README.md
* additional policy optimization train function simplifications
* Updated the traces after the reordering of the environment reset
* docker and jenkins files
* updated the traces to the ones from within the docker container
* updated traces and added control suite to the docker
* updated jenkins file with the intel proxy + updated doom basic a3c test params
* updated line breaks in jenkins file
* added a missing line break in jenkins file
* refining trace tests ignored presets + adding a configurable beta entropy value
* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue
* updated benchmarks for dueling ddqn breakout and pong
* allowing dynamic updates to the loss weights + bug fix in episode.update_returns
* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
Shadi Endrawis
7086492127
parallel trace tests fix
2018-09-03 20:47:10 +03:00
itaicaspi-intel
2c62a40466
bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues
2018-09-02 13:38:16 +03:00
Itai Caspi
3a399d1361
Tensorflow 1.10 and python 3.6 ( #104 )
...
* updating setup.py to install tensorflow 1.10 both on cpu and on gpu
* allow python 3.6
2018-09-02 10:12:00 +03:00
Gal Leibovich
5aca3a5ed1
Update README.md
2018-08-30 23:33:44 +03:00
Itai Caspi
55c3034f4d
Update README.md
2018-08-30 23:25:10 +03:00
Itai Caspi
e5526b98f8
Update README.md
2018-08-30 22:58:37 +03:00
Gal Leibovich
d862a3be83
rainbow dqn hyper-parameter updates
2018-08-30 20:41:38 +03:00
Shadi Endrawis
07db625987
Running trace tests in parallel + other small fixes
2018-08-30 19:35:10 +03:00
Gal Leibovich
ebe574e463
add missing hidden layer in rainbow_q_head
2018-08-30 19:34:27 +03:00
Gal Leibovich
ea294de7fd
adding dueling support for rainbow dqn (now only missing n-step)
2018-08-30 18:15:59 +03:00
Gal Leibovich
d2623c0eee
bug-fix in dueling dqn
2018-08-30 18:14:53 +03:00
Gal Leibovich
bbe7ac3338
Rainbow DQN agent (WIP - still missing dueling and n-step) + adding support for Prioritized ER for C51
2018-08-30 18:14:53 +03:00
itaicaspi-intel
fd2f4b0852
bug fix in HRL HER memory + some small improvements
2018-08-29 14:36:18 +03:00
Gal Leibovich
1aa2ab0590
parameter noise exploration - using Noisy Nets
2018-08-27 18:19:01 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Leibovich
d826382b11
removing test from Doom_Health_Supreme_DFP + relaxing time limit on reward tests
2018-08-26 18:42:41 +03:00
Gal Leibovich
2021490caa
small adjustment to golden tests + fixes for Doom_Health_DFP and Doom_Health_Supreme_DFP
2018-08-26 18:42:41 +03:00
Itai Caspi
3fd0bf4f0f
Update README.md
2018-08-26 12:09:46 +03:00
Gal Leibovich
9bb7bd2e9c
bug-fix in local_batch_run_coach and rename to run_multiple_seeds
2018-08-23 14:39:11 +03:00
Gal Leibovich
a4471389a4
brightened starcraft.gif
2018-08-20 13:50:09 +03:00
Gal Leibovich
904570000a
Update README.md
2018-08-20 12:04:29 +03:00
Gal Leibovich
5e275e9795
update starcraft gif
2018-08-20 11:49:19 +03:00
Shadi Endrawis
3abb6cd415
Trace tests update
2018-08-20 13:01:30 +03:00
Gal Leibovich
c1f428666e
bug-fix for checkpointing for single-worker algorithms
2018-08-19 20:17:15 +03:00
Itai Caspi
9f599f38cf
Update README.md
2018-08-19 13:09:06 +03:00
Itai Caspi
c5165cd7d6
benchmarks and pip package updates
2018-08-19 14:23:20 +03:00
Gal Leibovich
23d2945bf8
Update README.md
2018-08-19 11:02:45 +03:00
Itai Caspi
0be4a42701
updates needed for the pip package
2018-08-19 10:39:03 +03:00