1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00

Commit Graph

  • 06de3b0f07 update LevelManager type signature zach dwiel 2019-04-04 14:54:54 -04:00
  • f16cd3cb1e remove unused ActionInfo.action_intrinsic_reward zach dwiel 2019-04-04 11:55:42 -04:00
  • 7d79433c05 remove unused parameter scale_external_reward_by_intrinsic_reward_value zach dwiel 2019-04-04 11:54:18 -04:00
  • 881f78f45a tests: new checkpoint mxnet test + fix utils (#273) anabwan 2019-04-07 07:36:44 +03:00
  • e1e335a4ef disabled Starcraft from nightly (#286) anabwan 2019-04-04 22:26:25 +03:00
  • 2291cee2c6 allow serializing from/to arrays/str from GlobalVariableSaver (#285) Zach Dwiel 2019-04-04 11:09:19 -04:00
  • cdb8d9e518 tests: fix multi environment variables in configci (#284) anabwan 2019-04-04 16:11:41 +03:00
  • f173e69187 introduce dockerfiles. (#169) Scott Leishman 2019-04-03 09:33:17 -07:00
  • 0b808f0794 remove -ept flag (#283) shadiendrawis 2019-04-03 16:32:24 +03:00
  • a543f10c1a fix Intel tensorflow installation issue (#281) shadiendrawis 2019-04-03 13:03:30 +03:00
  • 5d4b9c7399 added functional environments to CircleCI (#268) anabwan 2019-03-29 00:45:19 +02:00
  • 869bd421a3 tests: added new checkpoint and functional tests (#265) anabwan 2019-03-28 22:57:31 +02:00
  • 310d31c227 integration test changes to reach the train part (#254) Gal Leibovich 2019-03-27 21:14:19 +02:00
  • 6e08c55ad5 Enabling-more-agents-for-Batch-RL-and-cleanup (#258) Gal Leibovich 2019-03-21 16:10:29 +02:00
  • abec59f367 fixes to rainbow dqn + a cartpole based golden test (#253) Gal Leibovich 2019-03-21 12:57:56 +02:00
  • 83741fa92a tests: added function tests to nightly CircleCI (#252) anabwan 2019-03-21 00:39:22 +02:00
  • e3c7e526c7 Batch RL (#238) Gal Leibovich 2019-03-19 18:07:09 +02:00
  • 4a8451ff02 tests: added new tests + utils code improved (#221) anabwan 2019-03-18 11:21:43 +02:00
  • d6158a5cfc restoring from a checkpoint file (#247) Gal Leibovich 2019-03-17 16:28:09 +02:00
  • f03bd7ad93 benchmark update (#250) shadiendrawis 2019-03-17 15:33:28 +02:00
  • 537b549e1d fixed broken url in README (#246) Nikhil Barhate 2019-03-14 11:08:33 +05:30
  • 9c449507e0 update CARLA install docs to note python client. (#234) Scott Leishman 2019-03-13 22:21:44 -07:00
  • 8be9ea5dc9 Update setup.py (#245) Gal Leibovich 2019-03-12 11:08:10 +02:00
  • c02333b1ba fix dashboard to allow connections from a remote machine. (#231) Gal Leibovich 2019-03-10 13:15:14 +02:00
  • 9a895a1ac7 bug-fix for l2_regularization not in use (#230) Gal Leibovich 2019-03-03 15:11:06 +02:00
  • 10220be9be Adding support for evaluation only mode with predefined number of steps (#225) Gal Novik 2019-03-03 10:03:45 +02:00
  • 2c1a9dbf20 Adding framework for multinode tests (#149) Ajay Deshpande 2019-02-26 13:53:12 -08:00
  • b461a1b8ab readme fix (#228) shadiendrawis 2019-02-24 13:46:21 +02:00
  • 2b5d1dabe6 ACER algorithm (#184) shadiendrawis 2019-02-20 23:52:34 +02:00
  • 7253f511ed tests: added new setup configuration + test args (#211) anabwan 2019-02-13 07:43:59 -05:00
  • 9d0fed84a3 run nightly golden tests sequentially. (#217) Scott Leishman 2019-02-04 07:18:35 -08:00
  • b4fd1b3c93 Create CONTRIBUTING.md (#210) Gal Novik 2019-01-29 23:47:22 +02:00
  • 135f02fb46 wxPython removal (#207) Gal Novik 2019-01-23 20:49:37 +02:00
  • 516547e3df ensure nightly test env containers exist. (#205) Scott Leishman 2019-01-18 13:43:42 -08:00
  • bf0a65eefd remove redundant observe ignore (#168) Cody Hsieh 2019-01-17 14:08:05 -08:00
  • a048024bf5 Add nightly running of golden and trace tests. (#202) Scott Leishman 2019-01-17 11:52:50 -08:00
  • 8672f8b542 Fix golden tests (#199) Zach Dwiel 2019-01-16 20:38:11 -05:00
  • fedb4cbd7c Cleanup and refactoring (#171) Zach Dwiel 2019-01-15 03:04:53 -05:00
  • cd812b0d25 more clear names for methods of Space (#181) Zach Dwiel 2019-01-14 15:02:53 -05:00
  • 0ccc333d77 raise value error if there is an invalid action space (#179) Zach Dwiel 2019-01-13 04:06:48 -05:00
  • 053adf0ca9 prevent long job CI timeouts owing to lack of EKS token refresh (#183) Scott Leishman 2019-01-09 15:12:00 -08:00
  • 0fa9d8e602 Update README.md (#182) Gal Novik 2019-01-08 13:48:17 +02:00
  • 8a1ea3d915 Merge pull request #161 from x77a1/master Ajay Deshpande 2019-01-03 21:15:04 -08:00
  • b1e9ea48d8 Refactored GlobalVariableSaver Gourav Roy 2019-01-03 15:08:34 -08:00
  • 619ea0944e Avoid Memory Leak in Rollout worker Gourav Roy 2019-01-02 23:06:44 -08:00
  • c377363e50 Revert "Changes to avoid memory leak in rollout worker" Gourav Roy 2019-01-02 22:37:12 -08:00
  • 779d3694b4 Revert "comment out the part of test in 'test_basic_rl_graph_manager_with_cartpole_dqn_and_repeated_checkpoint_restore' that run in infinite loop" Gourav Roy 2019-01-02 22:36:05 -08:00
  • 6dd7ae2343 Revert "Avoid Memory Leak in Rollout worker" Gourav Roy 2019-01-02 22:35:06 -08:00
  • 2461892c9e Revert "Updated comments" Gourav Roy 2019-01-02 22:33:37 -08:00
  • 740f7937cd Updated comments Gourav Roy 2018-12-25 21:52:07 -08:00
  • 73c4c850a5 Merge branch 'master' into master x77a1 2018-12-25 21:05:41 -08:00
  • c694766fad Avoid Memory Leak in Rollout worker Gourav Roy 2018-12-25 20:50:34 -08:00
  • 56735624ca Merge pull request #160 from NervanaSystems/tf_version_bump Gal Novik 2018-12-25 10:51:58 +02:00
  • 85fae0f626 Merge branch 'master' into tf_version_bump Gal Novik 2018-12-24 15:50:55 +02:00
  • d7c138342b Merge pull request #170 from NervanaSystems/ci_badge Gal Novik 2018-12-24 14:39:38 +02:00
  • 0823d30839 Merge branch 'master' into tf_version_bump Scott Leishman 2018-12-21 10:58:41 -05:00
  • 7cda5179c6 add CI status badge. Scott Leishman 2018-12-21 10:50:14 -05:00
  • 8e3ee818f8 update circle ci config to match new golden test presets (#167) Zach Dwiel 2018-12-21 10:10:31 -05:00
  • 02f2db1264 Merge branch 'master' into master x77a1 2018-12-17 12:44:27 -08:00
  • 4c914c057c fix for finding the right filter checkpoint to restore + do not update internal filter state when evaluating + fix SharedRunningStats checkpoint filenames (#147) Gal Leibovich 2018-12-17 21:36:27 +02:00
  • b4bc8a476c Bug fix: when enabling 'heatup_using_network_decisions', we should add the configured noise (#162) Neta Zmora 2018-12-17 10:08:54 +02:00
  • b8d21c73bf comment out the part of test in 'test_basic_rl_graph_manager_with_cartpole_dqn_and_repeated_checkpoint_restore' that run in infinite loop gouravr 2018-12-16 10:56:00 -08:00
  • 1f0980c448 Merge branch 'master' into master x77a1 2018-12-16 09:37:00 -08:00
  • f9ee526536 Fix for issue #128 - circular DQN import (#130) Gal Leibovich 2018-12-16 16:06:44 +02:00
  • 801aed5e10 Changes to avoid memory leak in rollout worker gouravr 2018-12-15 12:26:31 -08:00
  • aa1dfd7599 Bump intel optimized tensorflow to 1.12.0 Scott Leishman 2018-12-14 10:15:19 -05:00
  • e08accdc22 allow case insensitive selected level name matching zach dwiel 2018-12-10 19:43:57 -05:00
  • d0248e03c6 add meaningful error message in the event that the action space is not one that can be used (#151) Zach Dwiel 2018-12-11 02:09:24 -05:00
  • f12857a8c7 Docs changes - fixing blogpost links, removing importing all exploration policies (#139) Gal Leibovich 2018-12-05 23:16:16 +02:00
  • 155b78b995 Fix warning on import TF or MxNet, when only one of the frameworks is installed (#140) Sina Afrooze 2018-12-05 01:52:24 -08:00
  • 9e66bb653e Enable creating custom tensorflow heads, embedders, and middleware. (#135) Ryan Peach 2018-12-05 04:40:06 -05:00
  • 3c58ed740b 'CompositeAgent' object has no attribute 'handle_episode_ended' (#136) Ryan Peach 2018-12-05 04:28:16 -05:00
  • 436b16016e Added num_transitions to Memory interface (#137) Ryan Peach 2018-12-05 03:33:25 -05:00
  • 3e281b467b Update docs_raw README.md (#138) Gal Leibovich 2018-12-03 15:39:17 +02:00
  • 28e5b8b612 Minor bugfix on RewardFilter in Readme (#133) Ryan Peach 2018-11-30 19:02:08 -05:00
  • 3e67eac9e6 Merge pull request #131 from ryanpeach/patch-2 Scott Leishman 2018-11-30 15:55:34 -08:00
  • f678ae7cb8 NoOutputFilter isn't set in tutorial. Ryan Peach 2018-11-29 17:50:50 -05:00
  • 0dd39b20ca Removing badge Ajay Deshpande 2018-11-28 09:59:08 -08:00
  • 15fabf6ec3 Removing badge Ajay Deshpande 2018-11-28 09:19:32 -08:00
  • 533bb43720 Merge pull request #125 from NervanaSystems/0.11.0-release Gal Novik 2018-11-28 01:16:01 +02:00
  • e877920dd5 Merge pull request #126 from NervanaSystems/ci_updates Ajay Deshpande 2018-11-27 14:58:26 -08:00
  • 3601d9bc45 CI related updates Scott Leishman 2018-11-27 21:53:46 +00:00
  • 4e0d018d5f updated algorithms image in README Gal Novik 2018-11-27 23:12:13 +02:00
  • fc6604c09c added missing license headers Gal Novik 2018-11-27 22:43:40 +02:00
  • 1e618647ab adding .nojekyll file for github pages to function properly Gal Novik 2018-11-27 22:35:16 +02:00
  • 7e3aca22eb Documentation fix Gal Novik 2018-11-27 22:32:46 +02:00
  • 05c1005e94 Updated README and added .nojekyll file for github pages to work properly Gal Novik 2018-11-27 22:11:28 +02:00
  • d06197f663 Add documentation on distributed Coach. (#158) Balaji Subramaniam 2018-11-27 02:26:15 -08:00
  • e3ecf445e2 ensure we pull from main coach container layers as cache. (#106) Scott Leishman 2018-11-26 17:09:02 -08:00
  • 5674749ed5 workaround for resolving the issue of restoring a multi-node training checkpoint to single worker (#156) Gal Leibovich 2018-11-26 00:08:43 +02:00
  • ab10852ad9 hacky way to resolve the checkpointing issue (#154) Gal Leibovich 2018-11-25 16:14:15 +02:00
  • 11170d5ba3 fix dist. tf (#153) Gal Leibovich 2018-11-25 14:02:24 +02:00
  • 19a68812f6 Added ONNX compatible broadcast_like function (#152) Sina Afrooze 2018-11-25 01:23:18 -08:00
  • 8df425b6e1 Update how save checkpoint secs arg is handled in distributed Coach. (#151) Balaji Subramaniam 2018-11-25 00:05:24 -08:00
  • de9b707fe1 Changed run_multiple_seeds to support mxnet. And fix other bugs. (#122) Thom Lane 2018-11-24 22:33:09 -08:00
  • 77fb561668 Added code to fall back to CPU if GPU not available. (#150) Sina Afrooze 2018-11-24 22:32:26 -08:00
  • 7d25477942 Add observation_space_type to GymEnvironmentParameters so that it is possible to explicitly state that in presets. (#145) Sina Afrooze 2018-11-24 21:11:48 -08:00
  • bf2036b284 S3 optimization - save only the latest checkpoint. (#148) Balaji Subramaniam 2018-11-23 22:17:36 -08:00
  • 13d2679af4 Sync experiment dir, videos, gifs to S3. (#147) Balaji Subramaniam 2018-11-23 20:52:12 -08:00
  • 5332013bd1 Implement frame-work agnostic rollout and training workers (#137) Sina Afrooze 2018-11-23 18:05:44 -08:00