SMS dit: Trial without error: Towards safe reinforcement learning via human intervention