UAV's air combat decision-making based on deep deterministic policy gradient and prediction

To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situat...

Full description

Bibliographic Details
Main Authors:	LI Yongfeng, LYU Yongxi, SHI Jingping, LI Weihua
Format:	Article
Language:	zho
Published:	EDP Sciences 2023-02-01
Series:	Xibei Gongye Daxue Xuebao
Subjects:	无人机空战机动决策预测深度确定性策略梯度
Online Access:	https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html

_version_	1797426454154182656
author	LI Yongfeng LYU Yongxi SHI Jingping LI Weihua
author_facet	LI Yongfeng LYU Yongxi SHI Jingping LI Weihua
author_sort	LI Yongfeng
collection	DOAJ
description	To solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence.
first_indexed	2024-03-09T08:31:17Z
format	Article
id	doaj.art-1d5931f4cbc84ea1ba6fc387662d2a05
institution	Directory Open Access Journal
issn	1000-2758 2609-7125
language	zho
last_indexed	2024-03-09T08:31:17Z
publishDate	2023-02-01
publisher	EDP Sciences
record_format	Article
series	Xibei Gongye Daxue Xuebao
spelling	doaj.art-1d5931f4cbc84ea1ba6fc387662d2a052023-12-02T19:50:06ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252023-02-01411566410.1051/jnwpu/20234110056jnwpu2023411p56UAV's air combat decision-making based on deep deterministic policy gradient and predictionLI Yongfeng0LYU Yongxi1SHI Jingping2LI Weihua3School of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversitySchool of Automation, Northwestern Polytechnical UniversityTo solve the enemy uncertain manipulation problem during a UAV's autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV's six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV's autonomous air combat decision-making on the premise of ensuring convergence.https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html无人机空战机动决策预测深度确定性策略梯度
spellingShingle	LI Yongfeng LYU Yongxi SHI Jingping LI Weihua UAV's air combat decision-making based on deep deterministic policy gradient and prediction Xibei Gongye Daxue Xuebao 无人机空战机动决策预测深度确定性策略梯度
title	UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_full	UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_fullStr	UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_full_unstemmed	UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_short	UAV's air combat decision-making based on deep deterministic policy gradient and prediction
title_sort	uav s air combat decision making based on deep deterministic policy gradient and prediction
topic	无人机空战机动决策预测深度确定性策略梯度
url	https://www.jnwpu.org/articles/jnwpu/full_html/2023/01/jnwpu2023411p56/jnwpu2023411p56.html
work_keys_str_mv	AT liyongfeng uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT lyuyongxi uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT shijingping uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction AT liweihua uavsaircombatdecisionmakingbasedondeepdeterministicpolicygradientandprediction

UAV's air combat decision-making based on deep deterministic policy gradient and prediction

Similar Items