Training the reinforcement learning agent at the Gym