Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models

While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipul...

Full description

Bibliographic Details
Main Author: Feng, Meng
Other Authors: Williams, Brian C.
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139602
_version_ 1826211826013044736
author Feng, Meng
author2 Williams, Brian C.
author_facet Williams, Brian C.
Feng, Meng
author_sort Feng, Meng
collection MIT
description While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipulation problems that require online adaptation and dexterous interactions. Although designing behavior tree may work as short-term hotfix for simple manipulation tasks, such approach is not scalable as the complexity of the desired skill increases. Modern deep reinforcement learning techniques have shown good performances in learning closed-loop policies for a selected sets of atomic manipulation skills, but they often require a prohibitive amount of training data and lack interpretability in the learned policy models. To reduce the cost of learning closed-loop manipulation controllers and facilitate more transparency, we propose to a model-based reinforcement learning algorithm. Our algorithm learns deep probabilistic hybrid automata (DPHA), a novel graphical model that learns to predict both low-level state evolution as well as high-level transitions among distinct modalities. We also show that the discrete modes that naturally arise when learning DPHA models can provide promising insights that reveal semantically meaningful intentions and discover potentially generalizable skills. We present a sampling-based model predictive control algorithm that leverages the DPHA model to plan for actions over spatially and temporally extended horizons. Our benchmark shows that these algorithms are capable of achieving comparable asymptotic performance with up to 10 times less training data compared to standard benchmark algorithms on pushing and grasping problems.
first_indexed 2024-09-23T15:11:59Z
format Thesis
id mit-1721.1/139602
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T15:11:59Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1396022022-01-15T03:11:10Z Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models Feng, Meng Williams, Brian C. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipulation problems that require online adaptation and dexterous interactions. Although designing behavior tree may work as short-term hotfix for simple manipulation tasks, such approach is not scalable as the complexity of the desired skill increases. Modern deep reinforcement learning techniques have shown good performances in learning closed-loop policies for a selected sets of atomic manipulation skills, but they often require a prohibitive amount of training data and lack interpretability in the learned policy models. To reduce the cost of learning closed-loop manipulation controllers and facilitate more transparency, we propose to a model-based reinforcement learning algorithm. Our algorithm learns deep probabilistic hybrid automata (DPHA), a novel graphical model that learns to predict both low-level state evolution as well as high-level transitions among distinct modalities. We also show that the discrete modes that naturally arise when learning DPHA models can provide promising insights that reveal semantically meaningful intentions and discover potentially generalizable skills. We present a sampling-based model predictive control algorithm that leverages the DPHA model to plan for actions over spatially and temporally extended horizons. Our benchmark shows that these algorithms are capable of achieving comparable asymptotic performance with up to 10 times less training data compared to standard benchmark algorithms on pushing and grasping problems. S.M. 2022-01-14T15:22:39Z 2022-01-14T15:22:39Z 2021-06 2021-10-26T17:03:49.691Z Thesis https://hdl.handle.net/1721.1/139602 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Feng, Meng
Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title_full Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title_fullStr Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title_full_unstemmed Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title_short Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
title_sort model based learning and planning for intelligent manipulation using probabilistic hybrid models
url https://hdl.handle.net/1721.1/139602
work_keys_str_mv AT fengmeng modelbasedlearningandplanningforintelligentmanipulationusingprobabilistichybridmodels