Discovery In Reinforcement Learning