Guy Jacob
a1a2e67fbd
logging screen output to file ( #479 )
...
Co-authored-by: Gal Leibovich <gal.leibovich@intel.com >
2021-05-06 18:02:27 +03:00
Zach Dwiel
7b0fccb041
Add RedisDataStore ( #295 )
...
* GraphManager.set_session also sets self.sess
* make sure that GraphManager.fetch_from_worker uses training phase
* remove unnecessary phase setting in training worker
* reorganize rollout worker
* provide default name to GlobalVariableSaver.__init__ since it isn't really used anyway
* allow dividing TrainingSteps and EnvironmentSteps
* add timestamps to the log
* added redis data store
* conflict merge fix
2019-08-28 21:15:58 +03:00
shadiendrawis
8e812ef82f
Coach as a library ( #348 )
...
* CoachInterface + tutorial
* Some improvements and typo fixes
* merge tutorial 0 and 4
* typo fix + additional tutorial changes
* tutorial changes
* added reading signals and experiment path argument
2019-06-19 18:05:03 +03:00
Timo Kaufmann
8df3c46756
Do not hardcode path to bash ( #332 )
2019-06-10 20:10:28 +03:00
Ajay Deshpande
33dc29ee99
Uploading checkpoint if crd provided ( #191 )
...
* Uploading checkpoint if crd provided
* Changing the calculation of total steps because of a recent change in core_types
Fixes #195
2019-04-26 12:27:33 -07:00
Gal Leibovich
4741b0b916
BCQ variant on top of DDQN ( #276 )
...
* kNN based model for predicting which actions to drop
* fix for seeds with batch rl
2019-04-16 17:06:23 +03:00
shadiendrawis
0b808f0794
remove -ept flag ( #283 )
2019-04-03 16:32:24 +03:00
Gal Leibovich
d6158a5cfc
restoring from a checkpoint file ( #247 )
2019-03-17 16:28:09 +02:00
Gal Novik
10220be9be
Adding support for evaluation only mode with predefined number of steps ( #225 )
2019-03-03 10:03:45 +02:00
Ajay Deshpande
2c1a9dbf20
Adding framework for multinode tests ( #149 )
...
* Currently runs CartPole_ClippedPPO and Mujoco_ClippedPPO with inverted_pendulum level.
2019-02-26 13:53:12 -08:00
anabwan
7253f511ed
tests: added new setup configuration + test args ( #211 )
...
- added utils for future tests and conftest
- added test args
2019-02-13 07:43:59 -05:00
Gal Leibovich
f12857a8c7
Docs changes - fixing blogpost links, removing importing all exploration policies ( #139 )
...
* updated docs
* removing imports for all exploration policies in __init__ + setting the right blog-post link
* small cleanups
2018-12-05 16:16:16 -05:00
Balaji Subramaniam
8df425b6e1
Update how save checkpoint secs arg is handled in distributed Coach. ( #151 )
2018-11-25 00:05:24 -08:00
Balaji Subramaniam
13d2679af4
Sync experiment dir, videos, gifs to S3. ( #147 )
2018-11-23 20:52:12 -08:00
Ajay Deshpande
4a6c404070
Adding worker logs and plumbed task_parameters to distributed coach ( #130 )
2018-11-23 15:35:11 -08:00
Gal Leibovich
2b4c9c6774
Removing grarph_manager param ( #141 )
2018-11-23 11:42:54 -08:00
Thom Lane
949d91321a
Added explicit environment closing ( #129 )
2018-11-22 14:25:03 +02:00
Gal Leibovich
a112ee69f6
Save filters' internal state ( #127 )
...
* save filters internal state
* moving the restore to be made from within NumpyRunningStats
2018-11-20 17:21:48 +02:00
Thom Lane
9210909050
Added MXNet to arg docs. ( #121 )
2018-11-19 11:31:28 +02:00
Gal Leibovich
d4d06aaea6
remove kubernetes dependency ( #117 )
2018-11-18 18:10:22 +02:00
Gal Leibovich
6caf721d1c
Numpy shared running stats ( #97 )
2018-11-18 14:46:40 +02:00
Gal Leibovich
9fd4d55623
Making stop condition optional by using a flag ( #113 )
...
* apply stop condition flag (default: ignore the stop condition)
2018-11-18 13:37:39 +02:00
Balaji Subramaniam
dea1826658
Re-enable NFS data store. ( #101 )
2018-11-16 13:55:33 -08:00
Ajay Deshpande
fde73ced13
Simulating the act on the trainer. ( #65 )
...
* Remove the use of daemon threads for Redis subscribe.
* Emulate act and observe on trainer side to update internal vars.
2018-11-15 08:38:58 -08:00
Itai Caspi
6d40ad1650
update of api docstrings across coach and tutorials [WIP] ( #91 )
...
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00
Itai Caspi
0fe583186e
fixing the coach entrypoint after adding the CoachLauncher abstraction ( #92 )
2018-11-12 10:26:49 -08:00
Leo Dirac
2804a7c24f
Refactor launcher to be object-oriented ( #63 )
...
* Import of annoy library uses failed_import mechanism.
2018-11-10 22:10:19 +02:00
Itai Caspi
83e0b09a6a
adding the missing export_onnx_graph parameter to task parameters ( #73 )
2018-11-08 12:52:42 +02:00
Itai Caspi
e7a91b4dc3
Fix cmd line arguments handling ( #68 )
...
* refactoring the merging of the task parameters and the command line parameters
* removing some unused command line arguments
* fix for saving checkpoints when not passing through coach.py
2018-11-07 15:47:02 +02:00
Itai Caspi
811152126c
Export graph to ONNX ( #61 )
...
Implements the ONNX graph exporting feature.
Currently does not work for NAF, C51 and A3C_LSTM due to unsupported TF layers in the tf2onnx library.
2018-11-06 10:55:21 +02:00
Leo Dirac
d75df17d97
Modifying ScreenLogger to optionally not output color codes ( #56 )
...
* Modifying ScreenLogger to not output color when configured by new CLI parameter
2018-11-05 15:25:49 -08:00
Balaji Subramaniam
7e7006305a
Integrate coach.py params with distributed Coach. ( #42 )
...
* Integrate coach.py params with distributed Coach.
* Minor improvements
- Use enums instead of constants.
- Reduce code duplication.
- Ask experiment name with timeout.
2018-11-05 09:33:30 -08:00
Sina Afrooze
95b4fc6888
Added ability to switch between tensorflow and mxnet using -f commandline argument. ( #48 )
...
NOTE: tensorflow framework works fine if mxnet is not installed in env, but mxnet will not work if tensorflow is not installed because of the code in network_wrapper.
2018-10-30 15:29:34 -07:00
Thom Lane
324c67d614
Bug fix: Removed reference to args which is out of scope. Conditioning now performed one level above. ( #54 )
2018-10-29 22:29:22 -07:00
zach dwiel
f835ac902c
fix renaming: save_checkpoint_sec -> checkpoint_save_secs
2018-10-24 10:52:18 -04:00
Zach Dwiel
700a175902
rename save_checkpoint_secs -> checkpoint_save_secs
2018-10-23 17:10:58 -04:00
Zach Dwiel
9804b033a2
rename save_checkpoint_dir -> checkpoint_save_dir
2018-10-23 17:10:58 -04:00
Zach Dwiel
6541bc76b9
working checkpoints
2018-10-23 16:41:57 -04:00
Zach Dwiel
3714d8ec80
extract functions display_all_presets_and_exit, expand_preset
2018-10-23 16:40:33 -04:00
Zach Dwiel
5758c2f23e
typo; increased detail in comment
2018-10-23 16:35:06 -04:00
Shadi Endrawis
51726a5b80
network_imporvements branch merge
2018-10-02 13:43:36 +03:00
itaicaspi-intel
fd2f4b0852
bug fix in HRL HER memory + some small improvements
2018-08-29 14:36:18 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00