Rank2Reward: Learning Robot Reward Functions from Passive Video

Teaching robots novel skills with demonstrations via human-in-the-loop data collection techniques like kinesthetic teaching or teleoperation is a promising approach, but puts a heavy burden of data collection on human supervisors as well as instrumentation for inferring states and actions. In contra...

Полное описание

Библиографические подробности
Главный автор: Yang, Daniel Xin
Другие авторы: Agrawal, Pulkit
Формат: Диссертация
Опубликовано: Massachusetts Institute of Technology 2023
Online-ссылка:https://hdl.handle.net/1721.1/151463