Research on Maneuvering Decision Algorithm Based on Improved Deep Deterministic Policy Gradient

Autonomous maneuvering decisions of unmanned aerial vehicle (UAV) in short-range air combat remain a challenging research topic, and a decision method based on an improved deep deterministic policy gradient (DDPG) is proposed. First, the problem model is improved from the perspective of energy&#...

Full description

Bibliographic Details
Main Authors:	Jing Xianyong, Manyi Hou, Gaolong Wu, Zongcheng Ma, Zhongxiang Tao
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Unmanned aerial vehicle (UAV) maneuvering decision deep deterministic policy gradient (DDPG) short-range air combat reinforcement learning (RL)
Online Access:	https://ieeexplore.ieee.org/document/9869808/

Description
Summary:	Autonomous maneuvering decisions of unmanned aerial vehicle (UAV) in short-range air combat remain a challenging research topic, and a decision method based on an improved deep deterministic policy gradient (DDPG) is proposed. First, the problem model is improved from the perspective of energy–air combat, and a decision model with engine thrust, angle of attack, and roll angle as control variables is established. The normal and tangential overloads are determined by these control variables, and the decision is constrained by the flight stability and threshold range. Subsequently, the decision learning algorithm of the maneuver command is designed based on the DDPG framework. According to the energy air combat, speed is introduced into the return function in some states to make the return value more in line with reality. In view of the slow learning speed of the DDPG algorithm, the winning rate is introduced into the <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-greedy strategy to adjust the exploration and application probabilities in real time. In view of the decrease in computational efficiency caused by the large amount of empirical data, a similar empirical exclusion was carried out based on the vector distance. The simulation results show that the DDPG-based algorithm realizes autonomous decisions of engine thrust, roll angle, and attack angle under constraints, and the comparative simulation shows that the improvement measures are effective.
ISSN:	2169-3536

Research on Maneuvering Decision Algorithm Based on Improved Deep Deterministic Policy Gradient

Similar Items