Inverse reinforcement learning with locally consistent reward functions

Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated trajectory to be produced by only a single reward function. This paper presents a novel generalization of the IRL problem that allows each trajectory to be generated by multiple locally consistent reward...

Full description

Bibliographic Details
Main Authors:	Nguyen, Quoc Phong, Low, Bryan Kian Hsiang, Jaillet, Patrick
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	Neural Information Processing Systems Foundation 2018
Online Access:	http://hdl.handle.net/1721.1/113094 https://orcid.org/0000-0002-8585-6566

Internet

http://hdl.handle.net/1721.1/113094
https://orcid.org/0000-0002-8585-6566

Inverse reinforcement learning with locally consistent reward functions

Internet

Similar Items