Offline Reward Learning from Human Demonstrations and Feedback: A Linear Programming Approach
In many complex sequential decision-making tasks, there is often no known explicit reward function, and the only information available is human demonstrations and feedback data. To infer and shape the underlying reward function from this data, two key methodologies have emerged: inverse reinforcemen...
Main Author: | Kim, Kihyun |
---|---|
Other Authors: | Ozdaglar, Asuman |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2024
|
Online Access: | https://hdl.handle.net/1721.1/156337 |
Similar Items
-
Scalable reward learning from demonstration
by: Michini, Bernard J., et al.
Published: (2015) -
Bayesian nonparametric reward learning from demonstration
by: Michini, Bernard (Bernard J.)
Published: (2014) -
Involvement of human basal ganglia in offline feedback control of voluntary movement.
by: Brown, P, et al.
Published: (2006) -
Robot programming by human demonstration
by: Delson, Nathan Joseph
Published: (2007) -
Human-centric dialog training via offline reinforcement learning
by: Jaques, Natasha, et al.
Published: (2022)