mirror of
https://github.com/gryf/coach.git
synced 2025-12-17 19:20:19 +01:00
Itaicaspi/episode reset refactoring (#105)
* reordering of the episode reset operation and allowing to store episodes only when they are terminated * reordering of the episode reset operation and allowing to store episodes only when they are terminated * revert tensorflow-gpu to 1.9.0 + bug fix in should_train() * tests readme file and refactoring of policy optimization agent train function * Update README.md * Update README.md * additional policy optimization train function simplifications * Updated the traces after the reordering of the environment reset * docker and jenkins files * updated the traces to the ones from within the docker container * updated traces and added control suite to the docker * updated jenkins file with the intel proxy + updated doom basic a3c test params * updated line breaks in jenkins file * added a missing line break in jenkins file * refining trace tests ignored presets + adding a configurable beta entropy value * switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue * updated benchmarks for dueling ddqn breakout and pong * allowing dynamic updates to the loss weights + bug fix in episode.update_returns * remove docker and jenkins file
This commit is contained in:
@@ -141,6 +141,7 @@ class AlgorithmParameters(Parameters):
|
||||
self.rate_for_copying_weights_to_target = 1.0
|
||||
self.load_memory_from_file_path = None
|
||||
self.collect_new_data = True
|
||||
self.store_transitions_only_when_episodes_are_terminated = False
|
||||
|
||||
# HRL / HER related params
|
||||
self.in_action_space = None
|
||||
|
||||
Reference in New Issue
Block a user