Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models
While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipul...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/139602 |
_version_ | 1826211826013044736 |
---|---|
author | Feng, Meng |
author2 | Williams, Brian C. |
author_facet | Williams, Brian C. Feng, Meng |
author_sort | Feng, Meng |
collection | MIT |
description | While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipulation problems that require online adaptation and dexterous interactions. Although designing behavior tree may work as short-term hotfix for simple manipulation tasks, such approach is not scalable as the complexity of the desired skill increases. Modern deep reinforcement learning techniques have shown good performances in learning closed-loop policies for a selected sets of atomic manipulation skills, but they often require a prohibitive amount of training data and lack interpretability in the learned policy models. To reduce the cost of learning closed-loop manipulation controllers and facilitate more transparency, we propose to a model-based reinforcement learning algorithm. Our algorithm learns deep probabilistic hybrid automata (DPHA), a novel graphical model that learns to predict both low-level state evolution as well as high-level transitions among distinct modalities. We also show that the discrete modes that naturally arise when learning DPHA models can provide promising insights that reveal semantically meaningful intentions and discover potentially generalizable skills. We present a sampling-based model predictive control algorithm that leverages the DPHA model to plan for actions over spatially and temporally extended horizons. Our benchmark shows that these algorithms are capable of achieving comparable asymptotic performance with up to 10 times less training data compared to standard benchmark algorithms on pushing and grasping problems. |
first_indexed | 2024-09-23T15:11:59Z |
format | Thesis |
id | mit-1721.1/139602 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T15:11:59Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1396022022-01-15T03:11:10Z Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models Feng, Meng Williams, Brian C. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics While the rapid advancement of deep learning and grasp-affordance grasping has allowed the fast planning of grasping poses directly from visual inputs, it still commonly adopts an open-loop architecture that has made it slow to react and prone to failure, limiting its use in more complicated manipulation problems that require online adaptation and dexterous interactions. Although designing behavior tree may work as short-term hotfix for simple manipulation tasks, such approach is not scalable as the complexity of the desired skill increases. Modern deep reinforcement learning techniques have shown good performances in learning closed-loop policies for a selected sets of atomic manipulation skills, but they often require a prohibitive amount of training data and lack interpretability in the learned policy models. To reduce the cost of learning closed-loop manipulation controllers and facilitate more transparency, we propose to a model-based reinforcement learning algorithm. Our algorithm learns deep probabilistic hybrid automata (DPHA), a novel graphical model that learns to predict both low-level state evolution as well as high-level transitions among distinct modalities. We also show that the discrete modes that naturally arise when learning DPHA models can provide promising insights that reveal semantically meaningful intentions and discover potentially generalizable skills. We present a sampling-based model predictive control algorithm that leverages the DPHA model to plan for actions over spatially and temporally extended horizons. Our benchmark shows that these algorithms are capable of achieving comparable asymptotic performance with up to 10 times less training data compared to standard benchmark algorithms on pushing and grasping problems. S.M. 2022-01-14T15:22:39Z 2022-01-14T15:22:39Z 2021-06 2021-10-26T17:03:49.691Z Thesis https://hdl.handle.net/1721.1/139602 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Feng, Meng Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title | Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title_full | Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title_fullStr | Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title_full_unstemmed | Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title_short | Model-Based Learning and Planning for Intelligent Manipulation Using Probabilistic Hybrid Models |
title_sort | model based learning and planning for intelligent manipulation using probabilistic hybrid models |
url | https://hdl.handle.net/1721.1/139602 |
work_keys_str_mv | AT fengmeng modelbasedlearningandplanningforintelligentmanipulationusingprobabilistichybridmodels |