Offline Reward Learning from Human Demonstrations and Feedback: A Linear Programming Approach

Offline Reward Learning from Human Demonstrations and Feedback: A Linear Programming Approach

In many complex sequential decision-making tasks, there is often no known explicit reward function, and the only information available is human demonstrations and feedback data. To infer and shape the underlying reward function from this data, two key methodologies have emerged: inverse reinforcemen...

Full description

Bibliographic Details
Main Author:	Kim, Kihyun
Other Authors:	Ozdaglar, Asuman
Format:	Thesis
Published:	Massachusetts Institute of Technology 2024
Online Access:	https://hdl.handle.net/1721.1/156337

Similar Items

Scalable reward learning from demonstration
by: Michini, Bernard J., et al.
Published: (2015)

Bayesian nonparametric reward learning from demonstration
by: Michini, Bernard (Bernard J.)
Published: (2014)

Involvement of human basal ganglia in offline feedback control of voluntary movement.
by: Brown, P, et al.
Published: (2006)

Robot programming by human demonstration
by: Delson, Nathan Joseph
Published: (2007)

Human-centric dialog training via offline reinforcement learning
by: Jaques, Natasha, et al.
Published: (2022)

Social interaction for efficient agent learning from human reward
by: Li, Guangliang, et al.
Published: (2017)

Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback
by: Zhang, Jenny L.
Published: (2024)

Online and offline learning in operations
by: Wang, Li
Published: (2021)

Offline programming for peening of complex geometry
by: Chua, Aldrich Yu Han
Published: (2019)

Learning air traffic controller strategies with demonstration-based and physiological feedback
by: Pham, Duc-Thinh, et al.
Published: (2020)

Demonstration of feedback control using LEGO NXT
by: Yeo, Kai Xiang.
Published: (2012)

Rank2Reward: Learning Robot Reward Functions from Passive Video
by: Yang, Daniel Xin
Published: (2023)

Using informative behavior to increase engagement while learning from human reward
by: Li, Guangliang, et al.
Published: (2016)

Sequential offline-online-offline measurement approach for high-frequency LCLC resonant converters in the TWTA applications
by: Zhao, Bin, et al.
Published: (2022)

Mega-reward: Achieving human-level play without extrinsic rewards
by: Song, Y, et al.
Published: (2020)

Learning from demonstration in the wild
by: Behbahani, F, et al.
Published: (2019)

Learning from the wizard: Programming social interaction through teleoperated demonstrations
by: Breazeal, Cynthia
Published: (2021)

Offline Pricing and Demand Learning with Censored Data
by: Bu, Jinzhi, et al.
Published: (2023)

Tracing curves in the plane: geometric-invariant learning from human demonstrations
by: Turlapati, Sri Harsha, et al.
Published: (2024)

The Benefits of Offline Merchandise in Brand Building
by: Kim, Saemi
Published: (2022)

Segregated encoding of reward-identity and stimulus-reward associations in human orbitofrontal cortex.
by: Klein-Flügge, M, et al.
Published: (2013)

The design of an intuitive teaching interface for robot programming by human demonstration
by: Pinkney, James Bassey
Published: (2008)

Program synthesis from execution traces and demonstrations
by: Yessenov, Kuat T
Published: (2016)

Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents
by: Alumootil, Varkey
Published: (2022)

Demonstration of a Photonic-Based Linear Temperature Sensor
by: Tao, Ji Fang, et al.
Published: (2016)

Imitation learning from demonstration videos
by: Zeng, Jingbo
Published: (2024)

Living in an offline world
by: Teh, Boon Sung
Published: (2014)

Learning reward timing in cortex through reward dependent expression of synaptic plasticity
by: Gavornik, Jeffrey, et al.
Published: (2009)

Benchmarking Potential Based Rewards for Learning Humanoid Locomotion
by: Jeon, Se Hwan, et al.
Published: (2024)

Temporally dissociable contributions of human medial prefrontal subregions to reward-guided learning
by: Hauser, T, et al.
Published: (2015)

Evaluation of linear multiinput-multioutput feedback control for a human arm model
by: Massaquoi, Steven G
Published: (2005)

Efficient Model Learning from Joint-Action Demonstrations for Human-Robot Collaborative Tasks
by: Shah, Julie A, et al.
Published: (2017)

Learning optimal portfolios with intrinsic rewards
by: Guan, Zihang
Published: (2022)

Learning articulated motions from visual demonstration
by: Pillai, Sudeep
Published: (2014)

Stability of linear feedback systems.
by: Davis, Jon H., 1943-
Published: (2005)

Offline web subtitle editor
by: Tan, Yan Ling
Published: (2018)

Offline web subtitle editor
by: Yeo, Desmond Kok Leong
Published: (2021)

Offline Authentication of Untrusted Storage
by: Clarke, Dwaine, et al.
Published: (2023)

An evaluation on offline signature verification using artificial neural network approach
by: Khalifa, Othman Omran, et al.
Published: (2013)

Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
by: Munnian, Ruthrakumar
Published: (2019)