Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach

This paper studies the integration of data collection and offloading for maritime Internet of Things (IoT) systems with multiple unmanned aerial vehicles (UAVs). In the considered multi-UAV maritime IoT system, the UAVs act as the aerial base stations to complete the missions of data collection from...

Full description

Bibliographic Details
Main Authors:	Ziyi Liang, Yanpeng Dai, Ling Lyu, Bin Lin
Format:	Article
Language:	English
Published:	MDPI AG 2023-01-01
Series:	Remote Sensing
Subjects:	maritime IoT deep reinforcement learning UAV trajectory optimization hybrid proximal policy optimization
Online Access:	https://www.mdpi.com/2072-4292/15/2/292

_version_	1797437518707163136
author	Ziyi Liang Yanpeng Dai Ling Lyu Bin Lin
author_facet	Ziyi Liang Yanpeng Dai Ling Lyu Bin Lin
author_sort	Ziyi Liang
collection	DOAJ
description	This paper studies the integration of data collection and offloading for maritime Internet of Things (IoT) systems with multiple unmanned aerial vehicles (UAVs). In the considered multi-UAV maritime IoT system, the UAVs act as the aerial base stations to complete the missions of data collection from buoys and data offloading to the offshore base station (OBS). In this case, the UAVs need to adaptively select the mission mode between data collection and data offloading according to the network resources and mission requirements. In this paper, we aimed to minimize the completion time of data collection and offloading missions for all UAVs by jointly optimizing the UAV trajectories, mission mode selection, transmit power of buoys, and association relationships between the UAVs and buoy/OBS. In order to solve the mixed-integer non-convex minimization problem, we first designed a multi-agent deep reinforcement learning algorithm based on a hybrid discrete and continuous action space to preliminarily obtain the UAV trajectories, mission mode selection, and the transmit power of buoys. Then, we propose an algorithm based on the stable marriage problem to determine the buoy–UAV and UAV–OBS association relationships. Finally, the simulation results show that the proposed algorithms can effectively shorten the total mission completion time of data collection and offloading for the multi-UAV-assisted maritime IoT system.
first_indexed	2024-03-09T11:21:30Z
format	Article
id	doaj.art-b1e16a2e755c47a7a917f8d610a8d7aa
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-09T11:21:30Z
publishDate	2023-01-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-b1e16a2e755c47a7a917f8d610a8d7aa2023-12-01T00:18:22ZengMDPI AGRemote Sensing2072-42922023-01-0115229210.3390/rs15020292Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning ApproachZiyi Liang0Yanpeng Dai1Ling Lyu2Bin Lin3School of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaSchool of Information Science and Technology, Dalian Maritime University, Dalian 116026, ChinaThis paper studies the integration of data collection and offloading for maritime Internet of Things (IoT) systems with multiple unmanned aerial vehicles (UAVs). In the considered multi-UAV maritime IoT system, the UAVs act as the aerial base stations to complete the missions of data collection from buoys and data offloading to the offshore base station (OBS). In this case, the UAVs need to adaptively select the mission mode between data collection and data offloading according to the network resources and mission requirements. In this paper, we aimed to minimize the completion time of data collection and offloading missions for all UAVs by jointly optimizing the UAV trajectories, mission mode selection, transmit power of buoys, and association relationships between the UAVs and buoy/OBS. In order to solve the mixed-integer non-convex minimization problem, we first designed a multi-agent deep reinforcement learning algorithm based on a hybrid discrete and continuous action space to preliminarily obtain the UAV trajectories, mission mode selection, and the transmit power of buoys. Then, we propose an algorithm based on the stable marriage problem to determine the buoy–UAV and UAV–OBS association relationships. Finally, the simulation results show that the proposed algorithms can effectively shorten the total mission completion time of data collection and offloading for the multi-UAV-assisted maritime IoT system.https://www.mdpi.com/2072-4292/15/2/292maritime IoTdeep reinforcement learningUAV trajectory optimizationhybrid proximal policy optimization
spellingShingle	Ziyi Liang Yanpeng Dai Ling Lyu Bin Lin Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach Remote Sensing maritime IoT deep reinforcement learning UAV trajectory optimization hybrid proximal policy optimization
title	Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
title_full	Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
title_fullStr	Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
title_full_unstemmed	Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
title_short	Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
title_sort	adaptive data collection and offloading in multi uav assisted maritime iot systems a deep reinforcement learning approach
topic	maritime IoT deep reinforcement learning UAV trajectory optimization hybrid proximal policy optimization
url	https://www.mdpi.com/2072-4292/15/2/292
work_keys_str_mv	AT ziyiliang adaptivedatacollectionandoffloadinginmultiuavassistedmaritimeiotsystemsadeepreinforcementlearningapproach AT yanpengdai adaptivedatacollectionandoffloadinginmultiuavassistedmaritimeiotsystemsadeepreinforcementlearningapproach AT linglyu adaptivedatacollectionandoffloadinginmultiuavassistedmaritimeiotsystemsadeepreinforcementlearningapproach AT binlin adaptivedatacollectionandoffloadinginmultiuavassistedmaritimeiotsystemsadeepreinforcementlearningapproach

Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach

Similar Items