Study on UAV obstacle avoidance algorithm based on deep recurrent double Q network

The traditional reinforcement learning method has the problems of overestimation of value function and partial observability in the field of machine motion planning, especially in the obstacle avoidance problem of UAV, which lead to long training time and difficult convergence in the process of netw...

Full description

Bibliographic Details
Main Authors: WEI Yao, LIU Zhicheng, CAI Bin, CHEN Jiaxin, YANG Yao, ZHANG Kai
Format: Article
Language:zho
Published: EDP Sciences 2022-10-01
Series:Xibei Gongye Daxue Xuebao
Subjects:
Online Access:https://www.jnwpu.org/articles/jnwpu/full_html/2022/05/jnwpu2022405p970/jnwpu2022405p970.html