Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning

The collaborative mission capability of multi-UAV has received more and more attention in recent years as the research on multi-UAV theories and applications has intensified. The artificial intelligence technology integrated into the multi-UAV collaborative decision-making system can effectively imp...

Full description

Bibliographic Details
Main Authors: Shaowei Li, Yuhong Jia, Fan Yang, Qingyang Qin, Hui Gao, Yaoming Zhou
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9857929/
_version_ 1828420289135902720
author Shaowei Li
Yuhong Jia
Fan Yang
Qingyang Qin
Hui Gao
Yaoming Zhou
author_facet Shaowei Li
Yuhong Jia
Fan Yang
Qingyang Qin
Hui Gao
Yaoming Zhou
author_sort Shaowei Li
collection DOAJ
description The collaborative mission capability of multi-UAV has received more and more attention in recent years as the research on multi-UAV theories and applications has intensified. The artificial intelligence technology integrated into the multi-UAV collaborative decision-making system can effectively improve the collaborative mission capability of multi-UAV. We propose a multi-agent reinforcement learning algorithm for multi-UAV collaborative decision-making. Our approach is based on the actor-critic algorithm, where each UAV is treated as an actor that collects data decentralized in the environment. A centralized critic provides evaluation information for each training step during the centralized training of these actors. We introduce a gate recurrent unit in the actor to enable the UAV to make reasonable decisions concerning historical decision information. Moreover, we use an attention mechanism to design the centralized critic, which can achieve better learning in a complex environment. Finally, the algorithm is trained and experimented in a multi-UAV air combat scenario developed in the collaborative decision-making environment. The experimental results show that our approach can learn collaborative decision-making strategies with excellent performance, while convergence performance is better compared to other algorithms.
first_indexed 2024-12-10T15:09:38Z
format Article
id doaj.art-da74bbf055594857b9d5af1cb7248d72
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-10T15:09:38Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-da74bbf055594857b9d5af1cb7248d722022-12-22T01:43:59ZengIEEEIEEE Access2169-35362022-01-0110913859139610.1109/ACCESS.2022.31990709857929Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement LearningShaowei Li0https://orcid.org/0000-0002-8168-9916Yuhong Jia1Fan Yang2Qingyang Qin3Hui Gao4Yaoming Zhou5https://orcid.org/0000-0003-0410-327XSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing, ChinaThe collaborative mission capability of multi-UAV has received more and more attention in recent years as the research on multi-UAV theories and applications has intensified. The artificial intelligence technology integrated into the multi-UAV collaborative decision-making system can effectively improve the collaborative mission capability of multi-UAV. We propose a multi-agent reinforcement learning algorithm for multi-UAV collaborative decision-making. Our approach is based on the actor-critic algorithm, where each UAV is treated as an actor that collects data decentralized in the environment. A centralized critic provides evaluation information for each training step during the centralized training of these actors. We introduce a gate recurrent unit in the actor to enable the UAV to make reasonable decisions concerning historical decision information. Moreover, we use an attention mechanism to design the centralized critic, which can achieve better learning in a complex environment. Finally, the algorithm is trained and experimented in a multi-UAV air combat scenario developed in the collaborative decision-making environment. The experimental results show that our approach can learn collaborative decision-making strategies with excellent performance, while convergence performance is better compared to other algorithms.https://ieeexplore.ieee.org/document/9857929/UAVmulti-UAVcollaborative decision-makingmulti-agent reinforcement learning
spellingShingle Shaowei Li
Yuhong Jia
Fan Yang
Qingyang Qin
Hui Gao
Yaoming Zhou
Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
IEEE Access
UAV
multi-UAV
collaborative decision-making
multi-agent reinforcement learning
title Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
title_full Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
title_fullStr Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
title_full_unstemmed Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
title_short Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
title_sort collaborative decision making method for multi uav based on multiagent reinforcement learning
topic UAV
multi-UAV
collaborative decision-making
multi-agent reinforcement learning
url https://ieeexplore.ieee.org/document/9857929/
work_keys_str_mv AT shaoweili collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning
AT yuhongjia collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning
AT fanyang collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning
AT qingyangqin collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning
AT huigao collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning
AT yaomingzhou collaborativedecisionmakingmethodformultiuavbasedonmultiagentreinforcementlearning