Demonstration and offset augmented meta reinforcement learning with sparse rewards

Abstract This paper introduces DOAMRL, a novel meta-reinforcement learning (meta-RL) method that extends the Model-Agnostic Meta-Learning (MAML) framework. The method addresses a key limitation of existing meta-RL approaches, which struggle to effectively use suboptimal demonstrations to guide train...

Full description

Bibliographic Details
Main Authors: Haorui Li, Jiaqi Liang, Xiaoxuan Wang, Chengzhi Jiang, Linjing Li, Daniel Zeng
Format: Article
Language:English
Published: Springer 2025-02-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-025-01785-0