Sample efficiency, transfer learning and interpretability for deep reinforcement learning