Discovering blind spots in reinforcement learning

Agents trained in simulation may make errors in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult to discover because the agent cannot predict them a priori. We propose using oracle feedback to learn a predictive model of thes...

Full description

Bibliographic Details
Main Authors: Ramakrishnan, Ramya, Kamar, Ece, Dey, Debadeepta, Shah, Julie A, Horvitz, Eric
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: 2020
Online Access:https://hdl.handle.net/1721.1/125874