Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning

Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g...

Full description

Bibliographic Details
Main Authors:	Geramifard, Alborz, Redding, Joshua, How, Jonathan P.
Other Authors:	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format:	Article
Language:	en_US
Published:	Springer-Verlag 2013
Online Access:	http://hdl.handle.net/1721.1/81483 https://orcid.org/0000-0002-2508-1957 https://orcid.org/0000-0001-8576-1930

_version_	1826209242210631680
author	Geramifard, Alborz Redding, Joshua How, Jonathan P.
author2	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Geramifard, Alborz Redding, Joshua How, Jonathan P.
author_sort	Geramifard, Alborz
collection	MIT
description	Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g., linear and deterministic dynamics), yet inaccuracies in assumed models will impact the resulting performance. Learning techniques are capable of adapting the model and providing better policies asymptotically compared to cooperative planners, yet they often violate the safety conditions of the system due to their exploratory nature. Moreover they frequently require an impractically large number of interactions to perform well. This paper introduces the intelligent Cooperative Control Architecture (iCCA) as a framework for combining cooperative planners and reinforcement learning techniques. iCCA improves the policy of the cooperative planner, while reduces the risk and sample complexity of the learner. Empirical results in gridworld and task assignment for fuel-limited UAV domains with problem sizes up to 9 billion state-action pairs verify the advantage of iCCA over pure learning and planning strategies.
first_indexed	2024-09-23T14:19:44Z
format	Article
id	mit-1721.1/81483
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T14:19:44Z
publishDate	2013
publisher	Springer-Verlag
record_format	dspace
spelling	mit-1721.1/814832022-10-01T20:36:51Z Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning Geramifard, Alborz Redding, Joshua How, Jonathan P. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Massachusetts Institute of Technology. Laboratory for Information and Decision Systems Geramifard, Alborz How, Jonathan P. Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g., linear and deterministic dynamics), yet inaccuracies in assumed models will impact the resulting performance. Learning techniques are capable of adapting the model and providing better policies asymptotically compared to cooperative planners, yet they often violate the safety conditions of the system due to their exploratory nature. Moreover they frequently require an impractically large number of interactions to perform well. This paper introduces the intelligent Cooperative Control Architecture (iCCA) as a framework for combining cooperative planners and reinforcement learning techniques. iCCA improves the policy of the cooperative planner, while reduces the risk and sample complexity of the learner. Empirical results in gridworld and task assignment for fuel-limited UAV domains with problem sizes up to 9 billion state-action pairs verify the advantage of iCCA over pure learning and planning strategies. 2013-10-23T16:06:34Z 2013-10-23T16:06:34Z 2013-03 2012-08 Article http://purl.org/eprint/type/JournalArticle 0921-0296 1573-0409 http://hdl.handle.net/1721.1/81483 Geramifard, Alborz, Joshua Redding, and Jonathan P. How. “Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning.” Journal of Intelligent & Robotic Systems 72, no. 1 (October 13, 2013): 83-103. https://orcid.org/0000-0002-2508-1957 https://orcid.org/0000-0001-8576-1930 en_US http://dx.doi.org/10.1007/s10846-013-9826-6 Journal of Intelligent & Robotic Systems Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/ application/pdf Springer-Verlag MIT web domain
spellingShingle	Geramifard, Alborz Redding, Joshua How, Jonathan P. Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title	Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title_full	Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title_fullStr	Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title_full_unstemmed	Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title_short	Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
title_sort	intelligent cooperative control architecture a framework for performance improvement using safe learning
url	http://hdl.handle.net/1721.1/81483 https://orcid.org/0000-0002-2508-1957 https://orcid.org/0000-0001-8576-1930
work_keys_str_mv	AT geramifardalborz intelligentcooperativecontrolarchitectureaframeworkforperformanceimprovementusingsafelearning AT reddingjoshua intelligentcooperativecontrolarchitectureaframeworkforperformanceimprovementusingsafelearning AT howjonathanp intelligentcooperativecontrolarchitectureaframeworkforperformanceimprovementusingsafelearning

Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning

Similar Items