Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network

In emergency rescue missions, rescue teams can use UAVs and efficient path planning strategies to provide flexible rescue services for trapped people, which can improve rescue efficiency and reduce personnel risks. However, since the task environment of UAVs is usually complex, uncertain, and commun...

Full description

Bibliographic Details
Main Authors:	Wenshan Wang, Guoyin Zhang, Qingan Da, Dan Lu, Yingnan Zhao, Sizhao Li, Dapeng Lang
Format:	Article
Language:	English
Published:	MDPI AG 2023-09-01
Series:	Drones
Subjects:	deep reinforcement learning deep Q-network whale optimization algorithm multi-UAV path planning
Online Access:	https://www.mdpi.com/2504-446X/7/9/572

_version_	1797580511789449216
author	Wenshan Wang Guoyin Zhang Qingan Da Dan Lu Yingnan Zhao Sizhao Li Dapeng Lang
author_facet	Wenshan Wang Guoyin Zhang Qingan Da Dan Lu Yingnan Zhao Sizhao Li Dapeng Lang
author_sort	Wenshan Wang
collection	DOAJ
description	In emergency rescue missions, rescue teams can use UAVs and efficient path planning strategies to provide flexible rescue services for trapped people, which can improve rescue efficiency and reduce personnel risks. However, since the task environment of UAVs is usually complex, uncertain, and communication-limited, traditional path planning methods may not be able to meet practical needs. In this paper, we introduce a whale optimization algorithm into a deep Q-network and propose a path planning algorithm based on a whale-inspired deep Q-network, which enables UAVs to search for targets faster and safer in uncertain and complex environments. In particular, we first transform the UAV path planning problem into a Markov decision process. Then, we design a comprehensive reward function considering the three factors of path length, obstacle avoidance, and energy consumption. Next, we use the main framework of the deep Q-network to approximate the Q-value function by training a deep neural network. During the training phase, the whale optimization algorithm is introduced for path exploration to generate a richer action decision experience. Finally, experiments show that the proposed algorithm can enable the UAV to autonomously plan a collision-free feasible path in an uncertain environment. And compared with classic reinforcement learning algorithms, the proposed algorithm has a better performance in learning efficiency, path planning success rate, and path length.
first_indexed	2024-03-10T22:52:04Z
format	Article
id	doaj.art-57d1b29371584b279f90bfc025691c5b
institution	Directory Open Access Journal
issn	2504-446X
language	English
last_indexed	2024-03-10T22:52:04Z
publishDate	2023-09-01
publisher	MDPI AG
record_format	Article
series	Drones
spelling	doaj.art-57d1b29371584b279f90bfc025691c5b2023-11-19T10:17:10ZengMDPI AGDrones2504-446X2023-09-017957210.3390/drones7090572Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-NetworkWenshan Wang0Guoyin Zhang1Qingan Da2Dan Lu3Yingnan Zhao4Sizhao Li5Dapeng Lang6College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaCollege of Computer Science and Technology, Harbin Engineering University, Harbin 150001, ChinaIn emergency rescue missions, rescue teams can use UAVs and efficient path planning strategies to provide flexible rescue services for trapped people, which can improve rescue efficiency and reduce personnel risks. However, since the task environment of UAVs is usually complex, uncertain, and communication-limited, traditional path planning methods may not be able to meet practical needs. In this paper, we introduce a whale optimization algorithm into a deep Q-network and propose a path planning algorithm based on a whale-inspired deep Q-network, which enables UAVs to search for targets faster and safer in uncertain and complex environments. In particular, we first transform the UAV path planning problem into a Markov decision process. Then, we design a comprehensive reward function considering the three factors of path length, obstacle avoidance, and energy consumption. Next, we use the main framework of the deep Q-network to approximate the Q-value function by training a deep neural network. During the training phase, the whale optimization algorithm is introduced for path exploration to generate a richer action decision experience. Finally, experiments show that the proposed algorithm can enable the UAV to autonomously plan a collision-free feasible path in an uncertain environment. And compared with classic reinforcement learning algorithms, the proposed algorithm has a better performance in learning efficiency, path planning success rate, and path length.https://www.mdpi.com/2504-446X/7/9/572deep reinforcement learningdeep Q-networkwhale optimization algorithmmulti-UAVpath planning
spellingShingle	Wenshan Wang Guoyin Zhang Qingan Da Dan Lu Yingnan Zhao Sizhao Li Dapeng Lang Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network Drones deep reinforcement learning deep Q-network whale optimization algorithm multi-UAV path planning
title	Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
title_full	Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
title_fullStr	Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
title_full_unstemmed	Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
title_short	Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
title_sort	multiple unmanned aerial vehicle autonomous path planning algorithm based on whale inspired deep q network
topic	deep reinforcement learning deep Q-network whale optimization algorithm multi-UAV path planning
url	https://www.mdpi.com/2504-446X/7/9/572
work_keys_str_mv	AT wenshanwang multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT guoyinzhang multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT qinganda multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT danlu multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT yingnanzhao multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT sizhaoli multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork AT dapenglang multipleunmannedaerialvehicleautonomouspathplanningalgorithmbasedonwhaleinspireddeepqnetwork

Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network

Similar Items