Gal Leibovich
8f99409387
updating algorithms.png for README
2018-08-16 16:46:26 +03:00
Gal Leibovich
ab5a81c7ee
fix for dumping movies, without rendering, for pendulum_with_goals
2018-08-14 18:13:44 +03:00
Gal Leibovich
e783157b15
Update README.md
2018-08-14 16:16:41 +03:00
Itai Caspi
824fdeee59
Update README with new coach aliases
2018-08-14 14:36:41 +03:00
Gal Leibovich
7a76d63da4
Update README.md
2018-08-13 17:19:47 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00
Itai Caspi
d44c329bb8
Update README.md
2018-06-25 17:46:01 +03:00
Itai Caspi
cfd4fe0faf
Update README.md
2018-06-25 17:43:15 +03:00
Gal Leibovich
2807c29f27
fix for measurements in the initial state (fix for DFP)
2018-05-29 16:47:38 +03:00
itaicaspi-intel
7725dabc86
checkpoints bug fix
2018-05-26 17:49:13 +03:00
itaicaspi-intel
462c6e314b
bug fix in nec checkpoint saving
2018-05-24 15:15:33 +03:00
Itai Caspi
d302168c8c
Parallel agents fixes ( #95 )
...
* Parallel agents related bug fixes: checkpoint restore, tensorboard integration.
Adding narrow networks support.
Reference code for unlimited number of checkpoints
2018-05-24 14:24:19 +03:00
itaicaspi-intel
6c0b59b4de
constraining gym installation to version 0.9.4
2018-05-22 11:01:58 +03:00
itaicaspi-intel
a57b7004a8
updating dashboard
2018-05-09 09:26:15 +03:00
Gal Novik
dafdb05a7c
bug fixes for clippedppo and checkpoints
2018-04-30 15:13:29 +03:00
Itai Caspi
f31159aad6
bug fixes for carla environment ( #93 )
2018-04-23 11:13:24 +03:00
Itai Caspi
52eb159f69
multiple bug fixes in dealing with measurements + CartPole_DFP preset ( #92 )
2018-04-23 10:44:46 +03:00
itaicaspi-intel
5d5562bf62
moving the docs to github
2018-04-23 09:14:20 +03:00
jtoy
cafa152382
update requirements to have valid tornado version ( #84 )
2018-04-02 14:21:35 +03:00
Itai Caspi
a7206ed702
Multiple improvements and bug fixes ( #66 )
...
* Multiple improvements and bug fixes:
* Using lazy stacking to save on memory when using a replay buffer
* Remove step counting for evaluation episodes
* Reset game between heatup and training
* Major bug fixes in NEC (is reproducing the paper results for pong now)
* Image input rescaling to 0-1 is now optional
* Change the terminal title to be the experiment name
* Observation cropping for atari is now optional
* Added random number of noop actions for gym to match the dqn paper
* Fixed a bug where the evaluation episodes won't start with the max possible ale lives
* Added a script for plotting the results of an experiment over all the atari games
2018-02-26 12:29:07 +02:00
Zach Dwiel
4fe9cba445
remove debug
2018-02-21 10:05:57 -05:00
Zach Dwiel
eba900067c
remove debug
2018-02-21 10:05:57 -05:00
Zach Dwiel
d1bf83047c
remove debug
2018-02-21 10:05:57 -05:00
Zach Dwiel
ef46e194af
remove unused commented code
2018-02-21 10:05:57 -05:00
Zach Dwiel
d9303e731e
remove python2 compatibility
2018-02-21 10:05:57 -05:00
Zach Dwiel
ec68bd4959
make sure that for now observation spaces all include an observation key
2018-02-21 10:05:57 -05:00
Zach Dwiel
0740ebcdac
by default assume state["observation"] is where the image for rendering can be found
2018-02-21 10:05:57 -05:00
Zach Dwiel
f9f92a42fd
cleanup debugging code
2018-02-21 10:05:57 -05:00
Zach Dwiel
86362683b1
comment
2018-02-21 10:05:57 -05:00
Zach Dwiel
8fc24a2bbe
fix bc_agent
2018-02-21 10:05:57 -05:00
Zach Dwiel
d8f5a35013
fix qr_dqn_agent
2018-02-21 10:05:57 -05:00
Zach Dwiel
e1ad86417f
fix n_step_q_agent
2018-02-21 10:05:57 -05:00
Zach Dwiel
5cf10e5f52
fix bug in ddpg
2018-02-21 10:05:57 -05:00
Zach Dwiel
8248caf35e
fix more agents
2018-02-21 10:05:57 -05:00
Zach Dwiel
98f57a0d87
fix ddpg
2018-02-21 10:05:57 -05:00
Zach Dwiel
943e41ba58
fix nec_agent
2018-02-21 10:05:57 -05:00
Zach Dwiel
ee6e0bdc3b
fix keep_dims -> keepdims
2018-02-21 10:05:57 -05:00
Zach Dwiel
39a28aba95
fix clipped ppo
2018-02-21 10:05:57 -05:00
Zach Dwiel
85afb86893
temp commit
2018-02-21 10:05:57 -05:00
Gal Leibovich
16c5032735
fix for tensorboard visualization slowing execution even when it is off
...
apparently tensorflow still collect summary data even when no summary FileWriter is defined.
2018-02-18 16:35:24 +02:00
Itai Caspi
72d34f4063
adding a flag to prevent summary
2018-02-15 13:47:14 +02:00
Itai Caspi
55c8c87afc
allow visualizing the observation + bug fixes to coach summary
2018-02-15 13:47:14 +02:00
Itai Caspi
5d1a2bc392
Adding a summary when exiting coach
2018-02-13 11:11:26 +02:00
Itai Caspi
ba96e585d2
appending csv's from logger instead of rewriting them
2018-02-12 14:52:50 +02:00
Itai Caspi
569ca39ce6
Dashboard color selection + removing old legend
2018-02-09 14:52:58 +02:00
Itai Caspi
8a4383e86f
Added an improved legend to dashboard
2018-02-08 16:48:46 +02:00
Itai Caspi
b071599cb0
updating intel optimized tensorflow to version 1.4
2018-02-08 09:20:29 +02:00
Itai Caspi
462fe9796b
several bug fixes in dashboard
2018-02-07 12:49:34 +02:00
galleibo-intel
4025496783
Setting tensorflow-gpu version to 1.4.1 (1.5.0 is not tested yet)
2018-02-05 15:48:00 +02:00
Gal Leibovich
7c8962c991
adding support in tensorboard ( #52 )
...
* bug-fix in architecture.py where additional fetches would acquire more entries than it should
* change in run_test to allow ignoring some test(s)
2018-02-05 15:21:49 +02:00