Rank2Reward: Learning Robot Reward Functions from Passive Video
Teaching robots novel skills with demonstrations via human-in-the-loop data collection techniques like kinesthetic teaching or teleoperation is a promising approach, but puts a heavy burden of data collection on human supervisors as well as instrumentation for inferring states and actions. In contra...
Main Author: | Yang, Daniel Xin |
---|---|
Other Authors: | Agrawal, Pulkit |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/151463 |
Similar Items
-
Enhancing reward learning in the absence of an effect on reward
by: Browning, M
Published: (2023) -
Scalable reward learning from demonstration
by: Michini, Bernard J., et al.
Published: (2015) -
Learning reward timing in cortex through reward dependent expression of synaptic plasticity
by: Gavornik, Jeffrey, et al.
Published: (2009) -
Inverse reinforcement learning with locally consistent reward functions
by: Nguyen, Quoc Phong, et al.
Published: (2018) -
Bayesian nonparametric reward learning from demonstration
by: Michini, Bernard (Bernard J.)
Published: (2014)