Generalization strategy design of UAVs pursuit evasion game based on DDPG

UAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios....

Full description

Bibliographic Details
Main Authors: FU Xiaowei, XU Zhe, WANG Hui
Format: Article
Language:zho
Published: EDP Sciences 2022-02-01
Series:Xibei Gongye Daxue Xuebao
Subjects:
Online Access:https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.html
_version_ 1797641794978054144
author FU Xiaowei
XU Zhe
WANG Hui
author_facet FU Xiaowei
XU Zhe
WANG Hui
author_sort FU Xiaowei
collection DOAJ
description UAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios. Based on the DDPG(deep deterministic policy gradient) algorithm, a mathematical model of UAVs pursuit and evasion countermeasures is established in this paper. On this basis, this research designs a variety of countermaneuver strategies for escaping UAV, and uses the training method of course learning ideas. In the training process, the intelligence of the escaping UAV is gradually improved, so as to progressively train the confrontation strategy of the chasing UAV. The simulation results show that compared with direct training, the pursuit strategy of the chasing UAV trained by the research method of course learning can converge faster, and can better perform the hunting mission of enemy aircraft, and can be applied to a variety of enemy aircraft with a variety of maneuvering strategies, which effectively improved the generalization of the UAV′s pursuit and escape confrontation decision model.
first_indexed 2024-03-11T13:50:50Z
format Article
id doaj.art-095e719d7e3544518ac5d64be588e42b
institution Directory Open Access Journal
issn 1000-2758
2609-7125
language zho
last_indexed 2024-03-11T13:50:50Z
publishDate 2022-02-01
publisher EDP Sciences
record_format Article
series Xibei Gongye Daxue Xuebao
spelling doaj.art-095e719d7e3544518ac5d64be588e42b2023-11-02T08:57:17ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252022-02-01401475510.1051/jnwpu/20224010047jnwpu2022401p47Generalization strategy design of UAVs pursuit evasion game based on DDPGFU Xiaowei0XU Zhe1WANG Hui2School of Electronics and Information, Northwestern Polytechnical UniversitySchool of Electronics and Information, Northwestern Polytechnical UniversitySchool of Electronics and Information, Northwestern Polytechnical UniversityUAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios. Based on the DDPG(deep deterministic policy gradient) algorithm, a mathematical model of UAVs pursuit and evasion countermeasures is established in this paper. On this basis, this research designs a variety of countermaneuver strategies for escaping UAV, and uses the training method of course learning ideas. In the training process, the intelligence of the escaping UAV is gradually improved, so as to progressively train the confrontation strategy of the chasing UAV. The simulation results show that compared with direct training, the pursuit strategy of the chasing UAV trained by the research method of course learning can converge faster, and can better perform the hunting mission of enemy aircraft, and can be applied to a variety of enemy aircraft with a variety of maneuvering strategies, which effectively improved the generalization of the UAV′s pursuit and escape confrontation decision model.https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.htmluavpursuit-evasion gamedeep reinforcement learningddpgcurriculum learning
spellingShingle FU Xiaowei
XU Zhe
WANG Hui
Generalization strategy design of UAVs pursuit evasion game based on DDPG
Xibei Gongye Daxue Xuebao
uav
pursuit-evasion game
deep reinforcement learning
ddpg
curriculum learning
title Generalization strategy design of UAVs pursuit evasion game based on DDPG
title_full Generalization strategy design of UAVs pursuit evasion game based on DDPG
title_fullStr Generalization strategy design of UAVs pursuit evasion game based on DDPG
title_full_unstemmed Generalization strategy design of UAVs pursuit evasion game based on DDPG
title_short Generalization strategy design of UAVs pursuit evasion game based on DDPG
title_sort generalization strategy design of uavs pursuit evasion game based on ddpg
topic uav
pursuit-evasion game
deep reinforcement learning
ddpg
curriculum learning
url https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.html
work_keys_str_mv AT fuxiaowei generalizationstrategydesignofuavspursuitevasiongamebasedonddpg
AT xuzhe generalizationstrategydesignofuavspursuitevasiongamebasedonddpg
AT wanghui generalizationstrategydesignofuavspursuitevasiongamebasedonddpg