UAV's air combat decision-making based on deep deterministic policy gradient and prediction

To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situat...

Full description

Bibliographic Details
Main Authors: LI Yongfeng, LYU Yongxi, SHI Jingping, LI Weihua
Format: Article
Language:zho
Published: EDP Sciences 2023-02-01
Series:Xibei Gongye Daxue Xuebao
Subjects:
Online Access:https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html
_version_ 1797426454154182656
author LI Yongfeng
LYU Yongxi
SHI Jingping
LI Weihua
author_facet LI Yongfeng
LYU Yongxi
SHI Jingping
LI Weihua
author_sort LI Yongfeng
collection DOAJ
description To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence.
first_indexed 2024-03-09T08:31:17Z
format Article
id doaj.art-1d5931f4cbc84ea1ba6fc387662d2a05
institution Directory Open Access Journal
issn 1000-2758
2609-7125
language zho
last_indexed 2024-03-09T08:31:17Z
publishDate 2023-02-01
publisher EDP Sciences
record_format Article
series Xibei Gongye Daxue Xuebao
spelling doaj.art-1d5931f4cbc84ea1ba6fc387662d2a052023-12-02T19:50:06ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252023-02-01411566410.1051/jnwpu/20234110056jnwpu2023411p56UAV's air combat decision-making based on deep deterministic policy gradient and predictionLI Yongfeng0LYU Yongxi1SHI Jingping2LI Weihua3School of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversityTo solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence.https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html无人机空战机动决策预测深度确定性策略梯度
spellingShingle LI Yongfeng
LYU Yongxi
SHI Jingping
LI Weihua
UAV's air combat decision-making based on deep deterministic policy gradient and prediction
Xibei Gongye Daxue Xuebao
无人机
空战机动决策
预测
深度确定性策略梯度
title UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_full UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_fullStr UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_full_unstemmed UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_short UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_sort uav s air combat decision making based on deep deterministic policy gradient and prediction
topic 无人机
空战机动决策
预测
深度确定性策略梯度
url https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html
work_keys_str_mv AT liyongfeng uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction
AT lyuyongxi uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction
AT shijingping uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction
AT liweihua uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction