* Remove the use of daemon threads for Redis subscribe. * Emulate act and observe on trainer side to update internal vars.
* Adding initial interface for backend and redis pubsub * Addressing comments, adding super in all memories * Removing distributed experience replay