A Framework for LLM-based Lifelong Learning in Robot Manipulation

While robotic agents have become increasingly adept at low-level manipulation skills, increasingly they are being guided by large language model planners that decompose complex tasks into subgoals. Recent works indicate that these language models may also be effective skill learners. We develop HaLP...

Full description

Bibliographic Details
Main Author:	Mao, Jerry W.
Other Authors:	Agrawal, Pulkit
Format:	Thesis
Published:	Massachusetts Institute of Technology 2024
Online Access:	https://hdl.handle.net/1721.1/153895

_version_	1826197145005326336
author	Mao, Jerry W.
author2	Agrawal, Pulkit
author_facet	Agrawal, Pulkit Mao, Jerry W.
author_sort	Mao, Jerry W.
collection	MIT
description	While robotic agents have become increasingly adept at low-level manipulation skills, increasingly they are being guided by large language model planners that decompose complex tasks into subgoals. Recent works indicate that these language models may also be effective skill learners. We develop HaLP 2.0, a modular and extensible framework for lifelong learning in human-assisted language planning, using GPT-4 to propose a curriculum of skills that is learned, used, and intelligently reused. Our system is designed for large-scale experiments, is equipped with a user-friendly interface, and is extensible to new skill learning frameworks. We demonstrate extensibility by comparing alternative implementations of our abstractions and improving overall performance by incorporating novel frameworks. Moreover, we conduct a focused study of GPT-4, using crowd-sourced scene and task datasets, finding that language models are capable agents of skill reuse and adaptation. We observe that while performance is dependent on language context, supplying optimized prompts can yield exceptional skill reuse behaviors. We envision that as manipulation primitives and large language models become more powerful, our system will be ready to synthesize their capabilities to create an autonomous system for lifelong learning, that can one day be deployed in the real world.
first_indexed	2024-09-23T10:43:29Z
format	Thesis
id	mit-1721.1/153895
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T10:43:29Z
publishDate	2024
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1538952024-03-22T03:51:00Z A Framework for LLM-based Lifelong Learning in Robot Manipulation Mao, Jerry W. Agrawal, Pulkit Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science While robotic agents have become increasingly adept at low-level manipulation skills, increasingly they are being guided by large language model planners that decompose complex tasks into subgoals. Recent works indicate that these language models may also be effective skill learners. We develop HaLP 2.0, a modular and extensible framework for lifelong learning in human-assisted language planning, using GPT-4 to propose a curriculum of skills that is learned, used, and intelligently reused. Our system is designed for large-scale experiments, is equipped with a user-friendly interface, and is extensible to new skill learning frameworks. We demonstrate extensibility by comparing alternative implementations of our abstractions and improving overall performance by incorporating novel frameworks. Moreover, we conduct a focused study of GPT-4, using crowd-sourced scene and task datasets, finding that language models are capable agents of skill reuse and adaptation. We observe that while performance is dependent on language context, supplying optimized prompts can yield exceptional skill reuse behaviors. We envision that as manipulation primitives and large language models become more powerful, our system will be ready to synthesize their capabilities to create an autonomous system for lifelong learning, that can one day be deployed in the real world. M.Eng. 2024-03-21T19:14:18Z 2024-03-21T19:14:18Z 2024-02 2024-03-04T16:37:58.746Z Thesis https://hdl.handle.net/1721.1/153895 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Mao, Jerry W. A Framework for LLM-based Lifelong Learning in Robot Manipulation
title	A Framework for LLM-based Lifelong Learning in Robot Manipulation
title_full	A Framework for LLM-based Lifelong Learning in Robot Manipulation
title_fullStr	A Framework for LLM-based Lifelong Learning in Robot Manipulation
title_full_unstemmed	A Framework for LLM-based Lifelong Learning in Robot Manipulation
title_short	A Framework for LLM-based Lifelong Learning in Robot Manipulation
title_sort	framework for llm based lifelong learning in robot manipulation
url	https://hdl.handle.net/1721.1/153895
work_keys_str_mv	AT maojerryw aframeworkforllmbasedlifelonglearninginrobotmanipulation AT maojerryw frameworkforllmbasedlifelonglearninginrobotmanipulation

A Framework for LLM-based Lifelong Learning in Robot Manipulation

Similar Items