UAV's air combat decision-making based on deep deterministic policy gradient and prediction
To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situat...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
EDP Sciences
2023-02-01
|
Series: | Xibei Gongye Daxue Xuebao |
Subjects: | |
Online Access: | https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html |
_version_ | 1797426454154182656 |
---|---|
author | LI Yongfeng LYU Yongxi SHI Jingping LI Weihua |
author_facet | LI Yongfeng LYU Yongxi SHI Jingping LI Weihua |
author_sort | LI Yongfeng |
collection | DOAJ |
description | To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence. |
first_indexed | 2024-03-09T08:31:17Z |
format | Article |
id | doaj.art-1d5931f4cbc84ea1ba6fc387662d2a05 |
institution | Directory Open Access Journal |
issn | 1000-2758 2609-7125 |
language | zho |
last_indexed | 2024-03-09T08:31:17Z |
publishDate | 2023-02-01 |
publisher | EDP Sciences |
record_format | Article |
series | Xibei Gongye Daxue Xuebao |
spelling | doaj.art-1d5931f4cbc84ea1ba6fc387662d2a052023-12-02T19:50:06ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252023-02-01411566410.1051/jnwpu/20234110056jnwpu2023411p56UAV's air combat decision-making based on deep deterministic policy gradient and predictionLI Yongfeng0LYU Yongxi1SHI Jingping2LI Weihua3School of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversityTo solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence.https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html无人机空战机动决策预测深度确定性策略梯度 |
spellingShingle | LI Yongfeng LYU Yongxi SHI Jingping LI Weihua UAV's air combat decision-making based on deep deterministic policy gradient and prediction Xibei Gongye Daxue Xuebao 无人机 空战机动决策 预测 深度确定性策略梯度 |
title | UAV's air combat decision-making based on deep deterministic policy gradient and prediction |
title_full | UAV's air combat decision-making based on deep deterministic policy gradient and prediction |
title_fullStr | UAV's air combat decision-making based on deep deterministic policy gradient and prediction |
title_full_unstemmed | UAV's air combat decision-making based on deep deterministic policy gradient and prediction |
title_short | UAV's air combat decision-making based on deep deterministic policy gradient and prediction |
title_sort | uav s air combat decision making based on deep deterministic policy gradient and prediction |
topic | 无人机 空战机动决策 预测 深度确定性策略梯度 |
url | https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html |
work_keys_str_mv | AT liyongfeng uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT lyuyongxi uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT shijingping uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT liweihua uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction |