Rank2Reward: Learning Robot Reward Functions
from Passive Video

Rank2Reward: Learning Robot Reward Functions from Passive Video

Teaching robots novel skills with demonstrations via human-in-the-loop data collection techniques like kinesthetic teaching or teleoperation is a promising approach, but puts a heavy burden of data collection on human supervisors as well as instrumentation for inferring states and actions. In contra...

Full description

Bibliographic Details
Main Author:	Yang, Daniel Xin
Other Authors:	Agrawal, Pulkit
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/151463

Similar Items

Enhancing reward learning in the absence of an effect on reward
by: Browning, M
Published: (2023)

Scalable reward learning from demonstration
by: Michini, Bernard J., et al.
Published: (2015)

Learning reward timing in cortex through reward dependent expression of synaptic plasticity
by: Gavornik, Jeffrey, et al.
Published: (2009)

Inverse reinforcement learning with locally consistent reward functions
by: Nguyen, Quoc Phong, et al.
Published: (2018)

Bayesian nonparametric reward learning from demonstration
by: Michini, Bernard (Bernard J.)
Published: (2014)

There's reward in work
by: Chandran, Sheela
Published: (2014)

Antidepressants and reward
by: McCabe, C, et al.
Published: (2012)

Automatic shaping and decomposition of reward functions
by: Marthi, Bhaskara
Published: (2007)

The effects of rewards on executive function in preschoolers.
by: Lim, Huiqing.
Published: (2010)

Learning Reward Uncertainty in the Basal Ganglia
by: Bogacz, R, et al.
Published: (2016)

Learning optimal portfolios with intrinsic rewards
by: Guan, Zihang
Published: (2022)

The effect of noninstrumental information on reward learning
by: Embrey, JR, et al.
Published: (2024)

Coding of Reward Risk by Orbitofrontal Neurons Is Mostly Distinct from Coding of Reward Value
by: O'Neill, M, et al.
Published: (2010)

Mega-reward: Achieving human-level play without extrinsic rewards
by: Song, Y, et al.
Published: (2020)

Rewarding outstanding athletes
by: Ishak, Fadhli
Published: (2011)

The reward system of science
by: Paul-Hus, A, et al.
Published: (2017)

Differential impact of reward and punishment on functional connectivity after skill learning
by: Steel, A, et al.
Published: (2019)

A functional MRI study of the distributed neural circuitry of learning and reward
by: Awai, Alexandra F
Published: (2006)

An empowerment-based solution to robotic manipulation tasks with sparse rewards
by: Dai, Siyu, et al.
Published: (2023)

An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards
by: Dai, Siyu, et al.
Published: (2022)

Dynamic Planning and Learning under Recovering Rewards
by: Simchi-Levi, David, et al.
Published: (2023)

Linking reward-learning and affect in health and depression
by: Halahakoon, DC
Published: (2023)

Do learning rates adapt to the distribution of rewards?
by: Gershman, Samuel J.
Published: (2016)

Reward-guided learning with and without causal attribution
by: Jocham, G, et al.
Published: (2016)

FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
by: Abu Bakar, Mohamad Hafiz, et al.
Published: (2024)

Active Reward Learning for Co-Robotic Vision Based Exploration in Bandwidth Limited Environments
Published: (2021)

Active Reward Learning for Co-Robotic Vision Based Exploration in Bandwidth Limited Environments
by: Jamieson, Stewart Christopher., et al.
Published: (2021)

Segregated encoding of reward-identity and stimulus-reward associations in human orbitofrontal cortex.
by: Klein-Flügge, M, et al.
Published: (2013)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types
by: Millidge, B, et al.
Published: (2024)

Social interaction for efficient agent learning from human reward
by: Li, Guangliang, et al.
Published: (2017)

Social interaction for efficient agent learning from human reward
by: Li, G, et al.
Published: (2017)

ANALISIS PERBANDINGAN KINERJA REKSA DANA SAHAM MENGGUNAKAN METODE REWARD TO VARIABILITY (RVAR), REWARD TO VOLATILITY (RVOL), DAN REWARD TO DIVERSIFICATION (RDIV)
by: , Rafina, et al.
Published: (2012)

Gaussian Process Planning with Lipschitz Continuous Reward Functions
by: Ling, Chun Kai, et al.
Published: (2017)

Risk sensitivity for amounts of and delay to rewards: Adaptation for uncertainty or by-product of reward rate maximising?
by: Shapiro, MS, et al.
Published: (2012)

Risk sensitivity for amounts of and delay to rewards: adaptation for uncertainty or by-product of reward rate maximising?
by: Shapiro, MS, et al.
Published: (2012)

Great result and reward scholarship
by: Sin Chiew, Daily
Published: (2011)

The Routledge companion to reward management
Published: (2018)

Reward for young green thinkers
by: The Star,
Published: (2013)

AI alignment and human reward
by: Butlin, P
Published: (2021)

Antisocial rewarding in structured populations
by: Dos Santos, M, et al.
Published: (2017)