Itaicaspi/episode reset refactoring (#105)

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-15 05:25:55 +01:00

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* reordering of the episode reset operation and allowing to store episodes only when they are terminated

* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()

* tests readme file and refactoring of policy optimization agent train function

* Update README.md

* Update README.md

* additional policy optimization train function simplifications

* Updated the traces after the reordering of the environment reset

* docker and jenkins files

* updated the traces to the ones from within the docker container

* updated traces and added control suite to the docker

* updated jenkins file with the intel proxy + updated doom basic a3c test params

* updated line breaks in jenkins file

* added a missing line break in jenkins file

* refining trace tests ignored presets + adding a configurable beta entropy value

* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue

* updated benchmarks for dueling ddqn breakout and pong

* allowing dynamic updates to the loss weights + bug fix in episode.update_returns

* remove docker and jenkins file

This commit is contained in:

Itai Caspi

2018-09-04 15:07:54 +03:00

committed by

GitHub

parent 7086492127

commit 72a1d9d426

92 changed files with 9803 additions and 9740 deletions

									
										1

rl_coach/base_parameters.py
									
												View File
												
				@@ -141,6 +141,7 @@ class AlgorithmParameters(Parameters):

				        self.rate_for_copying_weights_to_target = 1.0

				        self.load_memory_from_file_path = None

				        self.collect_new_data = True

				        self.store_transitions_only_when_episodes_are_terminated = False

				        # HRL / HER related params

				        self.in_action_space = None

Itaicaspi/episode reset refactoring (#105)

1 rl_coach/base_parameters.py Unescape Escape View File

1

rl_coach/base_parameters.py

View File