1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-18 03:30:19 +01:00
Commit Graph

38 Commits

Author SHA1 Message Date
shadiendrawis
5ad5a58350 fix atari stack overflow (#412) 2019-10-06 18:14:21 +03:00
Gal Leibovich
c7949d7011 Fix Atari Schedule Heatup 2019-09-08 16:57:38 +03:00
Gal Leibovich
587b74e04a Remove double call to reset_internal_state() on gym environments (#364) 2019-07-02 13:43:23 +03:00
shadiendrawis
8e812ef82f Coach as a library (#348)
* CoachInterface + tutorial

* Some improvements and typo fixes

* merge tutorial 0 and 4

* typo fix + additional tutorial changes

* tutorial changes

* added reading signals and experiment path argument
2019-06-19 18:05:03 +03:00
anabwan
342b7184bc Enabling Coach Documentation to be run even when environments are not installed (#326) 2019-05-27 10:46:07 +03:00
anabwan
b567091d2e removed timestep_limit due to gym version upgrade (#325)
* removed timestep_limit due to gym version update

* removed _past_limit wrapper
2019-05-26 13:58:16 +03:00
Gal Leibovich
9f625c197b fix for fetch rendering (#297)
* fix for fetch rendering - removing code which was once required with older gym versions. images are now rendered correctly by default with the latest gym.

* fixing mujoco camera id failure
2019-04-21 17:37:14 +03:00
Gal Leibovich
310d31c227 integration test changes to reach the train part (#254)
* integration test changes to override heatup to 1000 steps +  run each preset for 30 sec (to make sure we reach the train part)

* fixes to failing presets uncovered with this change + changes in the golden testing to properly test BatchRL

* fix for rainbow dqn

* fix to gym_environment (due to a change in Gym 0.12.1) + fix for rainbow DQN + some bug-fix in utils.squeeze_list

* fix for NEC agent
2019-03-27 21:14:19 +02:00
Zach Dwiel
cd812b0d25 more clear names for methods of Space (#181)
* rename Space.val_matches_space_definition -> contains; Space.is_point_in_space_shape -> valid_index
* rename valid_index -> is_valid_index
2019-01-14 15:02:53 -05:00
Zach Dwiel
0ccc333d77 raise value error if there is an invalid action space (#179) 2019-01-13 11:06:48 +02:00
zach dwiel
e08accdc22 allow case insensitive selected level name matching 2018-12-11 12:35:30 -05:00
Sina Afrooze
7d25477942 Add observation_space_type to GymEnvironmentParameters so that it is possible to explicitly state that in presets. (#145) 2018-11-25 07:11:48 +02:00
Thom Lane
949d91321a Added explicit environment closing (#129) 2018-11-22 14:25:03 +02:00
Sina Afrooze
67a90ee87e Add tensor input type for arbitrary dimensional observation (#125)
* Allow arbitrary dimensional observation (non vector or image)
* Added creating PlanarMapsObservationSpace to GymEnvironment when number of channels is not 1 or 3
2018-11-19 16:41:12 +02:00
Gal Novik
e1fa6e9681 roboschool: updating envs to v1, fixing rendering (#112) 2018-11-18 13:38:10 +02:00
Itai Caspi
6d40ad1650 update of api docstrings across coach and tutorials [WIP] (#91)
* updating the documentation website
* adding the built docs
* update of api docstrings across coach and tutorials 0-2
* added some missing api documentation
* New Sphinx based documentation
2018-11-15 15:00:13 +02:00
Scott Leishman
524f8436a2 create per environment Dockerfiles. (#70)
* create per environment Dockerfiles.

Adjust CI setup to better parallelize runs.
Fix a couple of issues in golden and trace tests.
Update a few of the docs.

* bugfix in mmc agent.

Also install kubectl for CI, update badge branch.

* remove integration test parallelism.
2018-11-14 07:40:22 -08:00
Ajay Deshpande
875d6ef017 Adding target reward and target sucess (#58)
* Adding target reward

* Adding target successs

* Addressing comments

* Using custom_reward_threshold and target_success_rate

* Adding exit message

* Moving success rate to environment

* Making target_success_rate optional
2018-11-12 15:03:43 -08:00
Leo Dirac
8f0415b4cc Tweak additional_simulator_parameters for easier configuration and better error logging. (#69) 2018-11-07 11:01:12 -08:00
Itai Caspi
35c477c922 allowing grayscale observations in gym (#66)
* allowing grayscale observations in gym
2018-11-07 17:08:10 +02:00
Gal Leibovich
5a8da90d32 bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter') 2018-10-21 17:29:10 +03:00
Shadi Endrawis
51726a5b80 network_imporvements branch merge 2018-10-02 13:43:36 +03:00
Gal Novik
abaa58b559 human agent will exit when human control not supported by environment; jupyter notebooks fixes 2018-09-17 16:00:00 +03:00
itaicaspi-intel
bb76c5c726 CARLA cleanups + calculating the distance to goal 2018-09-16 16:37:04 +03:00
itaicaspi-intel
23a9f00e28 fix for human control 2018-09-16 12:43:15 +03:00
itaicaspi-intel
cf892463e2 updated CARLA to allow using actions of size 3 + automatic downloading of the CARLA imitation dataset 2018-09-16 12:07:11 +03:00
itaicaspi-intel
f8d3574b8c updated CARLA to allow the usage of predefined experiment suites 2018-09-14 18:07:24 +03:00
itaicaspi-intel
e8a2b679d1 using the CoRL2017 experiment suite for CARLA_CIL 2018-09-13 16:59:22 +03:00
itaicaspi-intel
607ef17431 added a simple progress bar implementation 2018-09-13 14:21:38 +03:00
itaicaspi-intel
fa79d8d365 Carla updates 2018-09-13 11:47:36 +03:00
itaicaspi-intel
171fe97a3a imitation related bug fixes 2018-09-12 15:26:16 +03:00
itaicaspi-intel
a9bd1047c4 load and save function for non-episodic replay buffers + carla improvements + network bug fixes 2018-09-12 15:26:16 +03:00
itaicaspi-intel
fd2f4b0852 bug fix in HRL HER memory + some small improvements 2018-08-29 14:36:18 +03:00
Gal Leibovich
1aa2ab0590 parameter noise exploration - using Noisy Nets 2018-08-27 18:19:01 +03:00
itaicaspi-intel
658b437079 removing datasets + imports optimization 2018-08-27 10:54:11 +03:00
Shadi Endrawis
3abb6cd415 Trace tests update 2018-08-20 13:01:30 +03:00
Gal Leibovich
ab5a81c7ee fix for dumping movies, without rendering, for pendulum_with_goals 2018-08-14 18:13:44 +03:00
Gal Novik
19ca5c24b1 pre-release 0.10.0 2018-08-13 17:11:34 +03:00