1
0
mirror of https://github.com/gryf/coach.git synced 2025-12-17 11:10:20 +01:00

Commit Graph

  • dea46ae0d2 Update 4. Batch Reinforcement Learning.ipynb master Gal Novik 2021-06-28 10:40:53 +03:00
  • 0633c32805 Disable nightly tests Guy Jacob 2021-06-03 09:07:39 +03:00
  • 0896f43097 Robosuite exploration (#478) shadiendrawis 2021-06-01 00:34:19 +03:00
  • 235a259223 Add Flatten layer to architectures + make flatten optional in embedders (#483) Guy Jacob 2021-05-12 11:11:10 +03:00
  • c369984c2e Update setuptools version in Dockerfile.base (#482) Guy Jacob 2021-05-09 09:34:33 +03:00
  • ba20396f63 Update Pillow version (#481) Guy Jacob 2021-05-09 09:29:48 +03:00
  • a1a2e67fbd logging screen output to file (#479) Guy Jacob 2021-05-06 18:02:27 +03:00
  • 9106b69227 Add is_on_policy property to agents (#480) Guy Jacob 2021-05-06 18:02:02 +03:00
  • 06bacd9de0 Fix Rust compiler build error (Kubernetes dependency) (#471) Guy Jacob 2021-02-09 15:54:44 +02:00
  • f52ff1784d Fix breaking change from minio update (#469) Guy Jacob 2020-12-15 10:02:16 +02:00
  • 59e08034c6 Update README.md Gal Novik 2020-11-09 10:25:05 +02:00
  • 57e809c094 Docs updates following github repo change Gal Novik 2020-11-08 11:54:38 +02:00
  • bc65f1f5fb Pin Vizdoom version - one more location (#468) Guy Jacob 2020-11-04 11:37:35 +02:00
  • 4318fea436 Update requirements.txt (#466) Gal Novik 2020-11-04 09:44:30 +02:00
  • fd765e7e38 Pin Vizdoom version (#467) Guy Jacob 2020-11-03 21:28:25 +02:00
  • 103d4477eb Disable NumPy and TF2 related warnings (#463) Guy Jacob 2020-09-24 15:11:45 +03:00
  • c9738280fd Require Python 3.6 + Changes to CI configuration (#452) Gal Novik 2020-07-26 16:11:22 +03:00
  • a6689b6036 Update cluster name in .circleci/config.yml (now all locations) Guy Jacob 2020-06-24 16:18:49 +03:00
  • 6658bfa429 Update cluster name in .circleci/config.yml Guy Jacob 2020-06-24 15:24:41 +03:00
  • f3ce685cb1 Upgrading Pillow version due to security vulnerability (#444) Gal Novik 2020-04-22 20:52:24 +03:00
  • 79b05a8105 Wolpertinger preset failure fix (#434) Gal Novik 2020-01-14 16:26:38 +02:00
  • 525a22cb5b Roll-back bokeh to version 1.0.4 (#431) Dan Elbaz 2019-12-23 09:33:53 +02:00
  • 0867d8d0fb Fixed typo: Nerual -> Neural (#425) Brian Broll 2019-11-16 13:13:24 -06:00
  • 188b86369a fix e-greedy in case action values were equal (#423) shadiendrawis 2019-11-10 17:20:44 +02:00
  • 6ca91b9090 add reset internal state to rollout worker (#421) shadiendrawis 2019-11-03 14:42:51 +02:00
  • e288a552dd Update requirements.txt (#422) Gal Leibovich 2019-10-28 18:30:48 +02:00
  • 66fada7f78 Remove assertion from BatchRLGraphManager Gal Leibovich 2019-10-22 11:54:14 +03:00
  • 6db695ad8a freeze tensorflow version to <= 1.14.0 (#416) shadiendrawis 2019-10-10 17:47:25 +03:00
  • 5ad5a58350 fix atari stack overflow (#412) shadiendrawis 2019-10-06 18:14:21 +03:00
  • 0a712ecc94 Fix numpy shared running stats to support images (#411) shadiendrawis 2019-10-06 12:16:38 +03:00
  • 79a4161eca Workaround for dumping gifs through the Python API (#405) Gal Leibovich 2019-09-26 12:21:25 +03:00
  • 9e82c06be3 importing heads parameters from the correct file on tutorial #1 (#403) Pi Esposito 2019-09-24 14:44:49 -03:00
  • 34bc292e60 Limiting intel-tensorflow version to 1.13.1 to re-enable CI; Updating nightly schedule to run on Saturdays as well Gal Novik 2019-09-23 12:52:00 +03:00
  • 0704260b5d Updating EKS cluster name Gal Novik 2019-09-20 16:12:35 +03:00
  • b5d66c0942 Removing CARLA docker file from README (#402) Gal Novik 2019-09-16 07:17:58 +03:00
  • c7949d7011 Fix Atari Schedule Heatup Gal Leibovich 2019-09-08 16:57:38 +03:00
  • 13a4a09f72 removing weekly tests (#398) Gal Novik 2019-09-08 14:04:24 +03:00
  • 138ced23ba RL in Large Discrete Action Spaces - Wolpertinger Agent (#394) Gal Leibovich 2019-09-08 12:53:49 +03:00
  • fc50398544 typo fix (#396) shadiendrawis 2019-09-04 12:40:23 +03:00
  • 7b0fccb041 Add RedisDataStore (#295) Zach Dwiel 2019-08-28 14:15:58 -04:00
  • 34e1c04f29 further CI cluster name updates. (#387) Scott Leishman 2019-08-06 00:18:08 -07:00
  • 92460736bc Updated tutorial and docs (#386) Gal Novik 2019-08-05 16:46:15 +03:00
  • c1d1fae342 Distiller's AMC induced changes (#359) Gal Leibovich 2019-08-05 10:24:58 +03:00
  • 7df67dafa3 update to point at new CI cluster. (#385) Scott Leishman 2019-08-04 03:55:04 -07:00
  • 2697142d5a Release 1.0.0 (#382) Gal Novik 2019-07-24 16:10:58 +03:00
  • 718597ce9a Fixes to Batch RL tutorial (#378) Gal Leibovich 2019-07-16 11:22:42 +03:00
  • 0a4cc7e081 Additional cmd line examples (#377) Gal Novik 2019-07-15 12:32:59 +03:00
  • 19ad2d60a7 Batch RL Tutorial (#372) Gal Leibovich 2019-07-14 18:43:48 +03:00
  • b82414138d Workaround the OSError due to bad address failure on the CI runs (#370) Gal Novik 2019-07-07 17:11:19 +03:00
  • 587b74e04a Remove double call to reset_internal_state() on gym environments (#364) Gal Leibovich 2019-07-02 13:43:23 +03:00
  • a576ab5659 tests: Removed mxnet from functional tests + minor fix on rewards (#362) anabwan 2019-06-27 18:52:29 +03:00
  • 30c64d0656 using gym=0.12.5 instead of latest (#360) anabwan 2019-06-24 10:34:28 +03:00
  • d6795bd524 batchnorm fixes + disabling batchnorm in DDPG (#353) Gal Leibovich 2019-06-23 11:28:22 +03:00
  • 7b5d6a3f03 tests: stabling functional tests (#355) anabwan 2019-06-20 15:30:47 +03:00
  • 8e812ef82f Coach as a library (#348) shadiendrawis 2019-06-19 18:05:03 +03:00
  • 1c90bc22a1 ci: using serial jobs in nightly (#350) anabwan 2019-06-17 10:53:36 +03:00
  • 7eb884c5b2 TD3 (#338) Gal Leibovich 2019-06-16 11:11:21 +03:00
  • 8df3c46756 Do not hardcode path to bash (#332) Timo Kaufmann 2019-06-10 19:10:28 +02:00
  • a1bb8eef89 DDPG Critic Head Bug Fix (#344) Gal Leibovich 2019-06-05 17:47:56 +03:00
  • 0aa5359d63 tests: added assert for cp param and changing test args order (#342) anabwan 2019-06-05 00:16:50 +03:00
  • e49aac05aa Update README.md (#341) Gal Novik 2019-06-04 11:35:34 +03:00
  • f6d5e60eff Added build base for nightly (#340) anabwan 2019-06-03 23:04:34 +03:00
  • 6e7e7f6d3d Update setup.py to 0.12.1 (#337) Gal Novik 2019-05-30 10:13:36 +03:00
  • 23df868d32 Removed unnecessary futures dependency (#336) anabwan 2019-05-29 14:34:48 +03:00
  • 4c996e147e applying filters for a csv loaded dataset + some bug-fixes in data loading (#319) Gal Leibovich 2019-05-28 15:44:55 +03:00
  • 6319387357 increase timeout for golden tests (#335) anabwan 2019-05-28 14:19:11 +03:00
  • f5ba14575c tests: print logs on failure + fix -cp param (#327) anabwan 2019-05-28 13:45:43 +03:00
  • 251dc9ccc0 Preset dependent number of csv read attempts in golden testing (#334) Gal Leibovich 2019-05-28 12:19:57 +03:00
  • ddffac8570 fixed release version (#333) anabwan 2019-05-28 11:11:15 +03:00
  • 9e9c4fd332 Create a dataset using an agent (#306) Gal Leibovich 2019-05-28 09:34:49 +03:00
  • 342b7184bc Enabling Coach Documentation to be run even when environments are not installed (#326) anabwan 2019-05-27 10:46:07 +03:00
  • 2b7d536da4 Add head regularization costs to tf.losses (#292) James Casbon 2019-05-26 15:15:42 +01:00
  • 3b6e413532 tests: fix traces and changing workflow jobs (#316) anabwan 2019-05-26 15:27:36 +03:00
  • b567091d2e removed timestep_limit due to gym version upgrade (#325) anabwan 2019-05-26 13:58:16 +03:00
  • 30c2b2fc45 moving to skimage.transform.resize (#321) Gal Leibovich 2019-05-23 13:38:01 +03:00
  • acceb03ac0 bug fixes for OPE (#311) Gal Leibovich 2019-05-21 16:39:11 +03:00
  • 85d70dd7d5 tests: fix traces export presets (#315) anabwan 2019-05-13 15:32:30 +03:00
  • f78bbbdbd1 tests: weekly deployment (#304) anabwan 2019-05-13 14:51:38 +03:00
  • deb0251367 bug fix following PR #191 (#313) Gal Leibovich 2019-05-12 23:42:45 +03:00
  • aa9f3cefaf Printing input size as part of network summary (#310) Gal Novik 2019-05-12 15:40:02 +03:00
  • ffb55b4142 tests: update traces (#302) anabwan 2019-05-07 10:04:05 +03:00
  • 740359587d tests: fixed nightly (#301) anabwan 2019-05-05 08:28:57 +03:00
  • 582921ffe3 OPE: Weighted Importance Sampling (#299) Gal Leibovich 2019-05-02 19:25:42 +03:00
  • 74db141d5e SAC algorithm (#282) guyk1971 2019-05-01 18:37:49 +03:00
  • 33dc29ee99 Uploading checkpoint if crd provided (#191) Ajay Deshpande 2019-04-26 12:27:33 -07:00
  • b3db9ce77d tests: fixed failed tests - stabling CI (#298) anabwan 2019-04-23 15:12:11 +03:00
  • 9f625c197b fix for fetch rendering (#297) Gal Leibovich 2019-04-21 17:37:14 +03:00
  • f14915cada tests: removed Starcraft from CI (#296) anabwan 2019-04-21 13:51:14 +03:00
  • 4741b0b916 BCQ variant on top of DDQN (#276) Gal Leibovich 2019-04-16 17:06:23 +03:00
  • bdb9b224a8 Include missing RegressionHead. (#263) Federico Andres Lois 2019-04-16 09:24:06 -03:00
  • 20a8dea0dd tests: minor fix for functional tests (#289) anabwan 2019-04-15 12:28:23 +03:00
  • 88f9c926ab update comment describing why the output filters don't modify Agent.last_action_info zach dwiel 2019-04-08 12:14:35 -04:00
  • fd2c210915 rename AgentInterface.emulate_observe_on_trainer or observe_transition and call from AgentInterface.observe zach dwiel 2019-04-05 12:11:21 -04:00
  • f8741522e4 merge AgentInterface.emulate_act_on_trainer and AgentInterface.act zach dwiel 2019-04-05 11:49:09 -04:00
  • f2fead57e5 change method interface: AgentInterface.emulate_act_on_trainer(transition: Transition) -> emulate_act_on_trainer(action: ActionType) zach dwiel 2019-04-05 10:53:03 -04:00
  • b20e795ce0 create method LevelManager.acting_agent() zach dwiel 2019-04-05 10:42:48 -04:00
  • 54fdfe2da8 simplify rollout worker steps with new magic methods on StepMethod zach dwiel 2019-04-04 16:13:56 -04:00
  • 2cb078b4c2 add __truediv__, __rtruediv__ and __eq__ to StepMethod zach dwiel 2019-04-04 16:11:07 -04:00
  • 83da5cde2f remove unnecessary parentheses zach dwiel 2019-04-04 14:57:08 -04:00
  • dddaefb210 fixed bug in rollout worker where total number of improved steps are not taken zach dwiel 2019-04-04 14:55:31 -04:00