Generalization strategy design of UAVs pursuit evasion game based on DDPG
UAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios....
Main Authors: | , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
EDP Sciences
2022-02-01
|
Series: | Xibei Gongye Daxue Xuebao |
Subjects: | |
Online Access: | https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.html |
_version_ | 1797641794978054144 |
---|---|
author | FU Xiaowei XU Zhe WANG Hui |
author_facet | FU Xiaowei XU Zhe WANG Hui |
author_sort | FU Xiaowei |
collection | DOAJ |
description | UAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios. Based on the DDPG(deep deterministic policy gradient) algorithm, a mathematical model of UAVs pursuit and evasion countermeasures is established in this paper. On this basis, this research designs a variety of countermaneuver strategies for escaping UAV, and uses the training method of course learning ideas. In the training process, the intelligence of the escaping UAV is gradually improved, so as to progressively train the confrontation strategy of the chasing UAV. The simulation results show that compared with direct training, the pursuit strategy of the chasing UAV trained by the research method of course learning can converge faster, and can better perform the hunting mission of enemy aircraft, and can be applied to a variety of enemy aircraft with a variety of maneuvering strategies, which effectively improved the generalization of the UAV′s pursuit and escape confrontation decision model. |
first_indexed | 2024-03-11T13:50:50Z |
format | Article |
id | doaj.art-095e719d7e3544518ac5d64be588e42b |
institution | Directory Open Access Journal |
issn | 1000-2758 2609-7125 |
language | zho |
last_indexed | 2024-03-11T13:50:50Z |
publishDate | 2022-02-01 |
publisher | EDP Sciences |
record_format | Article |
series | Xibei Gongye Daxue Xuebao |
spelling | doaj.art-095e719d7e3544518ac5d64be588e42b2023-11-02T08:57:17ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252022-02-01401475510.1051/jnwpu/20224010047jnwpu2022401p47Generalization strategy design of UAVs pursuit evasion game based on DDPGFU Xiaowei0XU Zhe1WANG Hui2School of Electronics and Information, Northwestern Polytechnical UniversitySchool of Electronics and Information, Northwestern Polytechnical UniversitySchool of Electronics and Information, Northwestern Polytechnical UniversityUAVs pursuit evasion game is a research hotspot in the field of air combat. Traditional solutions have many limitations to this problem, such as the difficulty of the model to adapt to complex dynamic environments to quickly make decisions, and the poor generalization of different mission scenarios. Based on the DDPG(deep deterministic policy gradient) algorithm, a mathematical model of UAVs pursuit and evasion countermeasures is established in this paper. On this basis, this research designs a variety of countermaneuver strategies for escaping UAV, and uses the training method of course learning ideas. In the training process, the intelligence of the escaping UAV is gradually improved, so as to progressively train the confrontation strategy of the chasing UAV. The simulation results show that compared with direct training, the pursuit strategy of the chasing UAV trained by the research method of course learning can converge faster, and can better perform the hunting mission of enemy aircraft, and can be applied to a variety of enemy aircraft with a variety of maneuvering strategies, which effectively improved the generalization of the UAV′s pursuit and escape confrontation decision model.https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.htmluavpursuit-evasion gamedeep reinforcement learningddpgcurriculum learning |
spellingShingle | FU Xiaowei XU Zhe WANG Hui Generalization strategy design of UAVs pursuit evasion game based on DDPG Xibei Gongye Daxue Xuebao uav pursuit-evasion game deep reinforcement learning ddpg curriculum learning |
title | Generalization strategy design of UAVs pursuit evasion game based on DDPG |
title_full | Generalization strategy design of UAVs pursuit evasion game based on DDPG |
title_fullStr | Generalization strategy design of UAVs pursuit evasion game based on DDPG |
title_full_unstemmed | Generalization strategy design of UAVs pursuit evasion game based on DDPG |
title_short | Generalization strategy design of UAVs pursuit evasion game based on DDPG |
title_sort | generalization strategy design of uavs pursuit evasion game based on ddpg |
topic | uav pursuit-evasion game deep reinforcement learning ddpg curriculum learning |
url | https://www.jnwpu.org/articles/jnwpu/full_html/2022/01/jnwpu2022401p47/jnwpu2022401p47.html |
work_keys_str_mv | AT fuxiaowei generalizationstrategydesignofuavspursuitevasiongamebasedonddpg AT xuzhe generalizationstrategydesignofuavspursuitevasiongamebasedonddpg AT wanghui generalizationstrategydesignofuavspursuitevasiongamebasedonddpg |