Trial without error: Towards safe reinforcement learning via human intervention

During training, model-free reinforcement learning (RL) systems can explore actions that lead to harmful or costly consequences. Having a human “in the loop” and ready to intervene at all times can prevent these mistakes, but is prohibitively expensive for current algorithms. We explore how human ov...

Full description

Bibliographic Details
Main Authors: Saunders, S, Sastry, G, Stuhlmüller, A, Evans, O
Format: Conference item
Published: ACM Digital Library 2018