Trial without error: Towards safe reinforcement learning via human intervention
During training, model-free reinforcement learning (RL) systems can explore actions that lead to harmful or costly consequences. Having a human “in the loop” and ready to intervene at all times can prevent these mistakes, but is prohibitively expensive for current algorithms. We explore how human ov...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Published: |
ACM Digital Library
2018
|