Long-run multi-robot planning under uncertain action durations for persistent tasks

This paper presents an approach for multi-robot long-term planning under uncertainty over the duration of actions. The proposed methodology takes advantage of generalized stochastic Petri nets with rewards (GSPNR) to model multi-robot problems. A GSPNR allows for unified modeling of action selection...

Szczegółowa specyfikacja

Opis bibliograficzny
Główni autorzy: Azevedo, C, Lacerda, B, Hawes, N, Lima, P
Format: Conference item
Język:English
Wydane: IEEE 2021
_version_ 1826288459892916224
author Azevedo, C
Lacerda, B
Hawes, N
Lima, P
author_facet Azevedo, C
Lacerda, B
Hawes, N
Lima, P
author_sort Azevedo, C
collection OXFORD
description This paper presents an approach for multi-robot long-term planning under uncertainty over the duration of actions. The proposed methodology takes advantage of generalized stochastic Petri nets with rewards (GSPNR) to model multi-robot problems. A GSPNR allows for unified modeling of action selection, uncertainty on the duration of action execution, and for goal specification through the use of transition rewards and rewards per time unit. Our approach relies on the interpretation of the GSPNR model as an equivalent embedded Markov reward automaton (MRA). We then build on a state-of-the-art method to compute the long-run average reward over MRAs, extending it to enable the extraction of the optimal policy. We provide an empirical evaluation of the proposed approach on a simulated multi-robot monitoring problem, evaluating its performance and scalability. The results show that the synthesized policy outperforms a policy obtained from an infinite horizon discounted reward formulation as well as a carefully hand-crafted policy.
first_indexed 2024-03-07T02:14:01Z
format Conference item
id oxford-uuid:a19cd643-0e9a-42ee-9cc9-82487b69d30c
institution University of Oxford
language English
last_indexed 2024-03-07T02:14:01Z
publishDate 2021
publisher IEEE
record_format dspace
spelling oxford-uuid:a19cd643-0e9a-42ee-9cc9-82487b69d30c2022-03-27T02:14:24ZLong-run multi-robot planning under uncertain action durations for persistent tasksConference itemhttp://purl.org/coar/resource_type/c_5794uuid:a19cd643-0e9a-42ee-9cc9-82487b69d30cEnglishSymplectic ElementsIEEE2021Azevedo, CLacerda, BHawes, NLima, PThis paper presents an approach for multi-robot long-term planning under uncertainty over the duration of actions. The proposed methodology takes advantage of generalized stochastic Petri nets with rewards (GSPNR) to model multi-robot problems. A GSPNR allows for unified modeling of action selection, uncertainty on the duration of action execution, and for goal specification through the use of transition rewards and rewards per time unit. Our approach relies on the interpretation of the GSPNR model as an equivalent embedded Markov reward automaton (MRA). We then build on a state-of-the-art method to compute the long-run average reward over MRAs, extending it to enable the extraction of the optimal policy. We provide an empirical evaluation of the proposed approach on a simulated multi-robot monitoring problem, evaluating its performance and scalability. The results show that the synthesized policy outperforms a policy obtained from an infinite horizon discounted reward formulation as well as a carefully hand-crafted policy.
spellingShingle Azevedo, C
Lacerda, B
Hawes, N
Lima, P
Long-run multi-robot planning under uncertain action durations for persistent tasks
title Long-run multi-robot planning under uncertain action durations for persistent tasks
title_full Long-run multi-robot planning under uncertain action durations for persistent tasks
title_fullStr Long-run multi-robot planning under uncertain action durations for persistent tasks
title_full_unstemmed Long-run multi-robot planning under uncertain action durations for persistent tasks
title_short Long-run multi-robot planning under uncertain action durations for persistent tasks
title_sort long run multi robot planning under uncertain action durations for persistent tasks
work_keys_str_mv AT azevedoc longrunmultirobotplanningunderuncertainactiondurationsforpersistenttasks
AT lacerdab longrunmultirobotplanningunderuncertainactiondurationsforpersistenttasks
AT hawesn longrunmultirobotplanningunderuncertainactiondurationsforpersistenttasks
AT limap longrunmultirobotplanningunderuncertainactiondurationsforpersistenttasks