Ajay Deshpande
16b3e99f37
Setup basic CI flow ( #38 )
...
Adds automated running of unit, integration tests (and optionally longer running tests)
2018-10-24 18:27:58 -07:00
zach dwiel
3ba0df7d07
update GraphManager.act specified return type
2018-10-23 19:58:17 -04:00
Zach Dwiel
700a175902
rename save_checkpoint_secs -> checkpoint_save_secs
2018-10-23 17:10:58 -04:00
Zach Dwiel
9804b033a2
rename save_checkpoint_dir -> checkpoint_save_dir
2018-10-23 17:10:58 -04:00
Zach Dwiel
201a2237a1
restructure looping mechanism inGraphManager
2018-10-23 17:10:58 -04:00
Zach Dwiel
52560a2aae
introduce property GraphManager.current_step_counter
2018-10-23 17:10:04 -04:00
Zach Dwiel
776c94d551
reorder methods in GraphManager
2018-10-23 17:10:04 -04:00
Zach Dwiel
496a516de1
rename GraphManager.sync_graph -> sync
2018-10-23 17:08:29 -04:00
Zach Dwiel
5fee48dcfd
remove argument keep_networks_in_sync from GraphManager.act, and move this feature into the only place that activated it: GraphManager.train_and_act
2018-10-23 17:08:29 -04:00
Zach Dwiel
b2d864a5bd
remove out of date documentation
2018-10-23 17:08:29 -04:00
Zach Dwiel
d32d909238
move only invocation of GraphManager.handle_episode_ended inline
2018-10-23 17:08:29 -04:00
Zach Dwiel
18d84c5037
remove unnecessary timers from GraphManager
2018-10-23 16:58:17 -04:00
Zach Dwiel
cd30efe52e
remove unnecessary test result is None in GraphManager.act
2018-10-23 16:57:43 -04:00
Zach Dwiel
35d67cbd9b
use phase context in GraphManager.evaluate
2018-10-23 16:57:43 -04:00
Zach Dwiel
d3c341147e
simplify GraphManager.act by removing arguments: continue_until_game_over and return_on_game_over
2018-10-23 16:57:43 -04:00
Zach Dwiel
8be980912c
fixed typo from earlier commit
2018-10-23 16:57:43 -04:00
Zach Dwiel
517aac163a
introduce graph_manager.phase_context; make sure that calls to graph_manager.train automatically set training phase
2018-10-23 16:57:43 -04:00
Zach Dwiel
7382a142bb
remove unused steps parameter from GraphManager.train
2018-10-23 16:57:06 -04:00
Zach Dwiel
ad68fa263d
remove property GraphManager.training_start_time
2018-10-23 16:57:05 -04:00
Zach Dwiel
01f3a0594b
remove return values from GraphManager.act
2018-10-23 16:57:05 -04:00
Zach Dwiel
b02f269464
graph_manager:heatup uses total_steps_counters looping mechanism like other loops. graph_manager:act no longer needs to return any values
2018-10-23 16:57:05 -04:00
Ajay Deshpande
0e121c5762
Ignoring redis sub if testing
2018-10-23 16:55:37 -04:00
Ajay Deshpande
a7f5442015
Adding should_train helper and should_train in graph_manager
2018-10-23 16:54:43 -04:00
Balaji Subramaniam
844a5af831
Make distributed coach work end-to-end.
...
- With data store, memory backend and orchestrator interfaces.
2018-10-23 16:54:43 -04:00
Zach Dwiel
9f92064e67
cleanup graph_manager:act
2018-10-23 16:53:32 -04:00
Zach Dwiel
ed3a3b39be
add comments
2018-10-23 16:52:16 -04:00
Zach Dwiel
13d81f65b9
add redis options to training worker
2018-10-23 16:47:46 -04:00
Zach Dwiel
6541bc76b9
working checkpoints
2018-10-23 16:41:57 -04:00
Zach Dwiel
433bc3e27b
standardizing variable access
2018-10-23 16:40:33 -04:00
Gal Leibovich
5a8da90d32
bug-fix for dumping movies (+ small refactoring and rename 'VideoDumpMethod -> 'VideoDumpFilter')
2018-10-21 17:29:10 +03:00
Shadi Endrawis
364168490f
checkpointing fix
2018-10-07 20:06:08 +03:00
Shadi Endrawis
51726a5b80
network_imporvements branch merge
2018-10-02 13:43:36 +03:00
Zach Dwiel
673911ff7f
very minor cleanup
2018-09-12 10:51:56 -04:00
itaicaspi-intel
171fe97a3a
imitation related bug fixes
2018-09-12 15:26:16 +03:00
Itai Caspi
72a1d9d426
Itaicaspi/episode reset refactoring ( #105 )
...
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* reordering of the episode reset operation and allowing to store episodes only when they are terminated
* revert tensorflow-gpu to 1.9.0 + bug fix in should_train()
* tests readme file and refactoring of policy optimization agent train function
* Update README.md
* Update README.md
* additional policy optimization train function simplifications
* Updated the traces after the reordering of the environment reset
* docker and jenkins files
* updated the traces to the ones from within the docker container
* updated traces and added control suite to the docker
* updated jenkins file with the intel proxy + updated doom basic a3c test params
* updated line breaks in jenkins file
* added a missing line break in jenkins file
* refining trace tests ignored presets + adding a configurable beta entropy value
* switch the order of trace and golden tests in jenkins + fix golden tests processes not killed issue
* updated benchmarks for dueling ddqn breakout and pong
* allowing dynamic updates to the loss weights + bug fix in episode.update_returns
* remove docker and jenkins file
2018-09-04 15:07:54 +03:00
itaicaspi-intel
658b437079
removing datasets + imports optimization
2018-08-27 10:54:11 +03:00
Gal Leibovich
c1f428666e
bug-fix for checkpointing for single-worker algorithms
2018-08-19 20:17:15 +03:00
Itai Caspi
1de04d6fee
updated gifs in README + fix for multiworker crashes + improved Atari DQN and Dueling DDQN presets
2018-08-16 18:23:32 +03:00
Gal Novik
19ca5c24b1
pre-release 0.10.0
2018-08-13 17:11:34 +03:00