Identifiability in inverse reinforcement learning

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions. As already observed in Russell [1998] the problem is ill-posed, and the reward function is not identifiable, even under the presence of perfect information ab...

Full description

Bibliographic Details
Main Authors: Cao, H, Cohen, S, Szpruch, L
Format: Conference item
Language:English
Published: Neural Information Processing Systems Foundation 2021