coach

gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-07-07 18:06:31 +02:00

Files

T

Gal Leibovich 310d31c227 integration test changes to reach the train part (#254 )

* integration test changes to override heatup to 1000 steps +  run each preset for 30 sec (to make sure we reach the train part)

* fixes to failing presets uncovered with this change + changes in the golden testing to properly test BatchRL

* fix for rainbow dqn

* fix to gym_environment (due to a change in Gym 0.12.1) + fix for rainbow DQN + some bug-fix in utils.squeeze_list

* fix for NEC agent

2019-03-27 21:14:19 +02:00

doom

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

mujoco

removing datasets + imports optimization

2018-08-27 10:54:11 +03:00

toy_problems

removing datasets + imports optimization

2018-08-27 10:54:11 +03:00

__init__.py

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

carla_environment.py

Adding target reward and target sucess (#58 )

2018-11-12 15:03:43 -08:00

CarlaSettings.ini

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

control_suite_environment.py

update of api docstrings across coach and tutorials [WIP] (#91 )

2018-11-15 15:00:13 +02:00

doom_environment.py

update of api docstrings across coach and tutorials [WIP] (#91 )

2018-11-15 15:00:13 +02:00

environment_interface.py

removing datasets + imports optimization

2018-08-27 10:54:11 +03:00

environment.py

more clear names for methods of Space (#181 )

2019-01-14 15:02:53 -05:00

gym_environment.py

integration test changes to reach the train part (#254 )

2019-03-27 21:14:19 +02:00

README.md

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

starcraft2_environment.py

Adding target reward and target sucess (#58 )

2018-11-12 15:03:43 -08:00

README.md

A custom environment implementation should look like this:

from coach.filters.input_filter import InputFilter

class CustomFilter(InputFilter):
  def __init__(self):
    ...
  def _filter(self, env_response: EnvResponse) -> EnvResponse:
    ...
  def _get_filtered_observation_space(self, input_observation_space: ObservationSpace) -> ObservationSpace:
    ...
  def _get_filtered_reward_space(self, input_reward_space: RewardSpace) -> RewardSpace:
    ...
  def _validate_input_observation_space(self, input_observation_space: ObservationSpace):
    ...
  def _reset(self):
    ...