Situational continuity-based air combat autonomous maneuvering decision-making

In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) u...

Full description

Bibliographic Details
Main Authors: Jian-dong Zhang, Yi-fei Yu, Li-hui Zheng, Qi-ming Yang, Guo-qing Shi, Yong Wu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2023-11-01
Series:Defence Technology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2214914722001842
Description
Summary:In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) under the framework of Deep Q Network (DQN) for air combat maneuvering decision-making. Considering the continuity between adjacent situations, the method takes multiple consecutive situations as one input of the neural network. To reflect the difference between adjacent situations, the method takes the difference of situation evaluation value as the reward of reinforcement learning. In different scenarios, the algorithm proposed in this paper is compared with the algorithm based on the Fully Neural Network (FNN) and the algorithm based on statistical principles respectively. The results show that, compared with the FNN algorithm, the algorithm proposed in this paper is more accurate and forward-looking. Compared with the algorithm based on the statistical principles, the decision-making of the algorithm proposed in this paper is more efficient and its real-time performance is better.
ISSN:2214-9147