Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience

We propose an actor-critic algorithm that uses past planning experience to improve the efficiency of solving robot task-and-motion planning (TAMP) problems. TAMP planners search for goal-achieving sequences of high-level operator instances specified by both discrete and continuous parameters. Our al...

Full description

Bibliographic Details
Main Authors: Kim, Beomjoon, Kaelbling, Leslie P, Lozano-Pérez, Tomás
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Published: Association for the Advancement of Artificial Intelligence (AAAI) 2021
Online Access:https://hdl.handle.net/1721.1/130053
_version_ 1826196412429238272
author Kim, Beomjoon
Kaelbling, Leslie P
Lozano-Pérez, Tomás
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Kim, Beomjoon
Kaelbling, Leslie P
Lozano-Pérez, Tomás
author_sort Kim, Beomjoon
collection MIT
description We propose an actor-critic algorithm that uses past planning experience to improve the efficiency of solving robot task-and-motion planning (TAMP) problems. TAMP planners search for goal-achieving sequences of high-level operator instances specified by both discrete and continuous parameters. Our algorithm learns a policy for selecting the continuous parameters during search, using a small training set generated from the search trees of previously solved instances. We also introduce a novel fixed-length vector representation for world states with varying numbers of objects with different shapes, based on a set of key robot configurations. We demonstrate experimentally that our method learns more efficiently from less data than standard reinforcementlearning approaches and that using a learned policy to guide a planner results in the improvement of planning efficiency.
first_indexed 2024-09-23T10:26:27Z
format Article
id mit-1721.1/130053
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T10:26:27Z
publishDate 2021
publisher Association for the Advancement of Artificial Intelligence (AAAI)
record_format dspace
spelling mit-1721.1/1300532021-09-10T19:55:28Z Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience Kim, Beomjoon Kaelbling, Leslie P Lozano-Pérez, Tomás Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science We propose an actor-critic algorithm that uses past planning experience to improve the efficiency of solving robot task-and-motion planning (TAMP) problems. TAMP planners search for goal-achieving sequences of high-level operator instances specified by both discrete and continuous parameters. Our algorithm learns a policy for selecting the continuous parameters during search, using a small training set generated from the search trees of previously solved instances. We also introduce a novel fixed-length vector representation for world states with varying numbers of objects with different shapes, based on a set of key robot configurations. We demonstrate experimentally that our method learns more efficiently from less data than standard reinforcementlearning approaches and that using a learned policy to guide a planner results in the improvement of planning efficiency. NSF (Grants 1523767 and 1723381) AFOSR (Grant FA9550-17-1-0165) ONR (Grant N00014-18-1-2847) 2021-03-02T19:31:01Z 2021-03-02T19:31:01Z 2019-07 Article http://purl.org/eprint/type/ConferencePaper 2374-3468 2159-5399 https://hdl.handle.net/1721.1/130053 Kim, Beomjoon et al. "Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience." Proceedings of the AAAI Conference on Artificial Intelligence 33, 1 (July 2019): 8017-8024 © 2019 Association for the Advancement of Artificial Intelligence http://dx.doi.org/10.1609/aaai.v33i01.33018017 Proceedings of the AAAI Conference on Artificial Intelligence Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for the Advancement of Artificial Intelligence (AAAI) MIT web domain
spellingShingle Kim, Beomjoon
Kaelbling, Leslie P
Lozano-Pérez, Tomás
Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title_full Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title_fullStr Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title_full_unstemmed Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title_short Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience
title_sort adversarial actor critic method for task and motion planning problems using planning experience
url https://hdl.handle.net/1721.1/130053
work_keys_str_mv AT kimbeomjoon adversarialactorcriticmethodfortaskandmotionplanningproblemsusingplanningexperience
AT kaelblinglesliep adversarialactorcriticmethodfortaskandmotionplanningproblemsusingplanningexperience
AT lozanopereztomas adversarialactorcriticmethodfortaskandmotionplanningproblemsusingplanningexperience