gryf/coach

mirror of https://github.com/gryf/coach.git synced 2026-02-11 03:05:57 +01:00

Files

Gal Leibovich 310d31c227 integration test changes to reach the train part (#254 )

* integration test changes to override heatup to 1000 steps +  run each preset for 30 sec (to make sure we reach the train part)

* fixes to failing presets uncovered with this change + changes in the golden testing to properly test BatchRL

* fix for rainbow dqn

* fix to gym_environment (due to a change in Gym 0.12.1) + fix for rainbow DQN + some bug-fix in utils.squeeze_list

* fix for NEC agent

2019-03-27 21:14:19 +02:00

agents

Added ability to switch between tensorflow and mxnet using -f commandline argument. (#48 )

2018-10-30 15:29:34 -07:00

architectures

Added ONNX compatible broadcast_like function (#152 )

2018-11-25 11:23:18 +02:00

environments

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

exploration_policies

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

filters

more clear names for methods of Space (#181 )

2019-01-14 15:02:53 -05:00

graph_managers

restoring from a checkpoint file (#247 )

2019-03-17 16:28:09 +02:00

memories

N-step returns for rainbow (#67 )

2018-11-07 18:33:08 +02:00

presets

integration test changes to reach the train part (#254 )

2019-03-27 21:14:19 +02:00

utils

tests: added new tests + utils code improved (#221 )

2019-03-18 11:21:43 +02:00

__init__.py

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

conftest.py

tests: added new tests + utils code improved (#221 )

2019-03-18 11:21:43 +02:00

pytest.ini

ignore deprecation warnings in test logging

2018-10-23 16:51:48 -04:00

README.md

tests: added new setup configuration + test args (#211 )

2019-02-13 07:43:59 -05:00

test_checkpoint.py

tests: added new tests + utils code improved (#221 )

2019-03-18 11:21:43 +02:00

test_coach_args.py

tests: added new tests + utils code improved (#221 )

2019-03-18 11:21:43 +02:00

test_core_types.py

restructure looping mechanism inGraphManager

2018-10-23 17:10:58 -04:00

test_dist_coach.py

Adding framework for multinode tests (#149 )

2019-02-26 13:53:12 -08:00

test_eks.py

prevent long job CI timeouts owing to lack of EKS token refresh (#183 )

2019-01-09 15:12:00 -08:00

test_golden.py

integration test changes to reach the train part (#254 )

2019-03-27 21:14:19 +02:00

test_saver.py

Adding checkpointing framework (#74 )

2018-11-19 19:45:49 +02:00

test_schedules.py

pre-release 0.10.0

2018-08-13 17:11:34 +03:00

test_spaces.py

ACER algorithm (#184 )

2019-02-20 23:52:34 +02:00

trace_tests.py

create per environment Dockerfiles. (#70 )

2018-11-14 07:40:22 -08:00

README.md

Coach - Tests

Coach is a complex framework consisting of various features and running schemes. On top of that, reinforcement learning adds stochasticity in many places along the experiments, which makes getting the same results run-after-run is almost impossible. To address those issues, and ensure that Coach keeps working as expected, we separated our testing mechanism into several parts, each testing the framework in different areas and strictness.

Docker -

The docker image we supply checks Coach in terms of installation process, and verifies that all the components are installed correctly. To build the Docker image, use the command:
```
cd docker
make build_base && make build
make run
```
Unit tests -

The unit tests test sub components of Coach with different parameters and verifies that they work as expected. There are currently tens of tests and we keep adding new ones. We use pytest in order to run the tests, using the following command:
```
python3 -m pytest rl_coach/tests -m unit_test
```
Integration tests -

The integration tests make sure that all the presets are runnable. It's a static tests that does not check the performance at all. It only checks that the preset can start running with no import error or other bugs. To run the integration tests, use the following command:
```
python3 -m pytest rl_coach/tests -m integration_test
```
Golden tests -

The golden tests run a subset of the presets available in Coach, and verify that they pass a known score after a known amount of steps. The threshold for the tests are defined as part of each preset. The presets which are tested are presets that can be run in a short amount of time, and the requirements for passing are quite weak. The golden tests can be run using the following command:
```
python3 -m pytest rl_coach/tests -m golden_test
```
Trace tests -

The trace tests run all the presets available in Coach, and compare their csv output to traces we extracted after verifying each preset works correctly. The requirements for passing these tests are quite strict - all the values in the csv file should match the golden csv file exactly. The trace tests can be run in parallel to shorten the testing time. To run the tests in parallel use the following command:
```
python3 rl_coach/tests/trace_tests.py -prl
```
Optional PyTest Flags -

Using -k expr to select tests based on their name; The -k command line option to specify an expression which implements a substring match on the test names instead of the exact match on markers that -m provides. This makes it easy to select tests based on their names:
```
python3 -m pytest rl_coach/tests -k Doom
```
Using -v (--verbose) expr to show tests progress during running the tests, -v can be added with -m or with -k, to use -v see the following commands:
```
python3 -m pytest rl_coach/tests -v -m golden_test
OR
python3 -m pytest rl_coach/tests -v -k Doom
```