Trial without error: Towards safe reinforcement learning via human intervention

During training, model-free reinforcement learning (RL) systems can explore actions that lead to harmful or costly consequences. Having a human “in the loop” and ready to intervene at all times can prevent these mistakes, but is prohibitively expensive for current algorithms. We explore how human ov...

Celý popis

Podrobná bibliografie
Hlavní autoři:	Saunders, S, Sastry, G, Stuhlmüller, A, Evans, O
Médium:	Conference item
Vydáno:	ACM Digital Library 2018

Trial without error: Towards safe reinforcement learning via human intervention

Podobné jednotky