Zach Dwiel
e34b9ae9cf
allow specifying preset as a commandline parameter to rollout worker
2018-10-23 16:40:33 -04:00
Zach Dwiel
3714d8ec80
extract functions display_all_presets_and_exit, expand_preset
2018-10-23 16:40:33 -04:00
Ajay Deshpande
21f8ca3978
Removing comments and pytests
2018-10-23 16:40:33 -04:00
Ajay Deshpande
5a54f67a63
Adding distributed experience replay
2018-10-23 16:40:33 -04:00
Zach Dwiel
747000647f
add dockerfile
2018-10-23 16:40:33 -04:00
Zach Dwiel
bc664c4169
add the first pass of rollout_worker.py
2018-10-23 16:40:33 -04:00
Zach Dwiel
61ed6b8ce4
add better defaults to TaskParameters
2018-10-23 16:40:33 -04:00
Zach Dwiel
5758c2f23e
typo; increased detail in comment
2018-10-23 16:35:06 -04:00
Zach Dwiel
a1295d16b3
first pass that transition collection interface
2018-10-23 16:35:06 -04:00
Zach Dwiel
dc77c54ad9
add to gitignore
2018-10-23 16:35:06 -04:00
Zach Dwiel
9f1f9e5ab4
replace ExperienceReplay._num_transitions with len(ExperienceReplay.transitions)
2018-10-23 16:34:38 -04:00
Zach Dwiel
cccfe88f9b
remove unused method: update_last_transition_info
2018-10-23 16:34:38 -04:00
Zach Dwiel
fb21251157
add horizontal scaling document
2018-10-23 16:34:38 -04:00
Gal Leibovich
5a8da90d32
bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter')
2018-10-21 17:29:10 +03:00
Shadi Endrawis
364168490f
checkpointing fix
2018-10-07 20:06:08 +03:00
Gal Novik
5c4f9d58dd
renamed quick start guide tutorial
2018-10-03 18:15:29 +03:00
Shadi Endrawis
f7990d4003
trace tests update
2018-10-02 17:55:16 +03:00
Shadi Endrawis
51726a5b80
network_imporvements branch merge
2018-10-02 13:43:36 +03:00
Gal Leibovich
72ea933384
bug-fix for clipped_ppo not logging several signals + small cleanup
2018-10-02 14:22:37 +03:00
itaicaspi-intel
73cc6e39d0
bug fix for clipped ppo for discrete controls
2018-09-18 10:40:53 +03:00
Gal Novik
abaa58b559
human agent will exit when human control not supported by environment; jupyter notebooks fixes
2018-09-17 16:00:00 +03:00
itaicaspi-intel
bb76c5c726
CARLA cleanups + calculating the distance to goal
2018-09-16 16:37:04 +03:00
itaicaspi-intel
6797824892
bug fixes in the CARLA dataset downloader and extractor
2018-09-16 14:27:22 +03:00
itaicaspi-intel
23a9f00e28
fix for human control
2018-09-16 12:43:15 +03:00
itaicaspi-intel
cf892463e2
updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset
2018-09-16 12:07:11 +03:00
itaicaspi-intel
d3c8a5d7c1
remove some accidentaly committed files
2018-09-14 18:22:04 +03:00
itaicaspi-intel
f8d3574b8c
updated CARLA to allow the usage of predefined experiment suites
2018-09-14 18:07:24 +03:00
itaicaspi-intel
e8a2b679d1
using the CoRL2017 experiment suite for CARLA_CIL
2018-09-13 16:59:22 +03:00
itaicaspi-intel
06c969951e
adding docker and jenkins files
2018-09-13 16:07:47 +03:00
itaicaspi-intel
d3f97cd93b
initial CIL implementation (WIP)
2018-09-13 15:29:29 +03:00
itaicaspi-intel
99649c1626
progress bar update
2018-09-13 15:03:24 +03:00
itaicaspi-intel
607ef17431
added a simple progress bar implementation
2018-09-13 14:21:38 +03:00
itaicaspi-intel
fa79d8d365
Carla updates
2018-09-13 11:47:36 +03:00
itaicaspi-intel
fa4895f840
new traces
2018-09-13 11:47:36 +03:00
Zach Dwiel
673911ff7f
very minor cleanup
2018-09-12 10:51:56 -04:00
itaicaspi-intel
a16d724963
removing some of the presets from the trace tests + more robust replay buffer loading
2018-09-12 15:26:16 +03:00
itaicaspi-intel
171fe97a3a
imitation related bug fixes
2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4
load and save function for non-episodic replay buffers + carla improvements + network bug fixes
2018-09-12 15:26:16 +03:00
Itai Caspi
d59a700248
updated benchmarks for pong and breakout for dueling ddqn with PER
2018-09-06 14:05:46 +03:00
Gal Leibovich
08a557bfd1
updated the benchmarks for space invaders with dueling ddqn variants
2018-09-06 12:13:49 +03:00
Itai Caspi
72a1d9d426
Itaicaspi/episode reset refactoring ( #105 )
...
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()
* tests readme file and refactoring of policy optimization agent train function
* Update README.md
* Update README.md
* additional policy optimization train function simplifications
* Updated the traces after the reordering of the environment reset
* docker and jenkins files
* updated the traces to the ones from within the docker container
* updated traces and added control suite to the docker
* updated jenkins file with the intel proxy + updated doom basic a3c test params
* updated line breaks in jenkins file
* added a missing line break in jenkins file
* refining trace tests ignored presets + adding a configurable beta entropy value
* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue
* updated benchmarks for dueling ddqn breakout and pong
* allowing dynamic updates to the loss weights + bug fix in episode.update_returns
* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
Shadi Endrawis
7086492127
parallel trace tests fix
2018-09-03 20:47:10 +03:00
itaicaspi-intel
2c62a40466
bug fix in dueling network + revert to TF 1.6 for CPU due to requirements compatibility issues
2018-09-02 13:38:16 +03:00
Itai Caspi
3a399d1361
Tensorflow 1.10 and python 3.6 ( #104 )
...
* updating setup.py to install tensorflow 1.10 both on cpu and on gpu
* allow python 3.6
2018-09-02 10:12:00 +03:00
Gal Leibovich
5aca3a5ed1
Update README.md
2018-08-30 23:33:44 +03:00
Itai Caspi
55c3034f4d
Update README.md
2018-08-30 23:25:10 +03:00
Itai Caspi
e5526b98f8
Update README.md
2018-08-30 22:58:37 +03:00
Gal Leibovich
d862a3be83
rainbow dqn hyper-parameter updates
2018-08-30 20:41:38 +03:00
Shadi Endrawis
07db625987
Running trace tests in parallel + other small fixes
2018-08-30 19:35:10 +03:00
Gal Leibovich
ebe574e463
add missing hidden layer in rainbow_q_head
2018-08-30 19:34:27 +03:00