Study on UAV obstacle avoidance algorithm based on deep recurrent double Q network
The traditional reinforcement learning method has the problems of overestimation of value function and partial observability in the field of machine motion planning, especially in the obstacle avoidance problem of UAV, which lead to long training time and difficult convergence in the process of netw...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
EDP Sciences
2022-10-01
|
Series: | Xibei Gongye Daxue Xuebao |
Subjects: | |
Online Access: | https://www.jnwpu.org/articles/jnwpu/full_html/2022/05/jnwpu2022405p970/jnwpu2022405p970.html |