Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication

Unmanned aerial vehicles (UAVs) or drones have attracted much attention in wireless communication networks because of their agility, unique flexibility, low cost of implementation, and the high strength of the line-of-sight (LoS) channel. They are widely used in different scenarios. In many environm...

Full description

Bibliographic Details
Main Authors: Sahar Baghdady, Seyed Masoud Mirrezaei, Rashid Mirzavand
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10462115/
_version_ 1797243290255360000
author Sahar Baghdady
Seyed Masoud Mirrezaei
Rashid Mirzavand
author_facet Sahar Baghdady
Seyed Masoud Mirrezaei
Rashid Mirzavand
author_sort Sahar Baghdady
collection DOAJ
description Unmanned aerial vehicles (UAVs) or drones have attracted much attention in wireless communication networks because of their agility, unique flexibility, low cost of implementation, and the high strength of the line-of-sight (LoS) channel. They are widely used in different scenarios. In many environments with complex geographical conditions or in situations where areas are affected by natural disasters, UAVs can be used as base stations (BSs) for downlink ground users. The article proposes a communication system using multiple UAV-mounted BSs to improve coverage rate and minimize the number of required UAVs. The problem is formulated as a mixed-integer programming problem with constraints on the quality of service (QoS) and serviceability of each UAV. A three-step method is developed to solve the problem, which includes deriving the maximum service radius of UAVs using the Karush-Kuhn-Tucker (KKT) method, minimizing the number of required UAVs using reinforcement learning (RL) algorithm, and designing the three-dimensional (3D) position and frequency band of each UAV to increase signal power and reduce interference. The simulation results show that the RLP algorithm outperforms other algorithms in terms of coverage rate, user clustering, increased signal, reduced interference, and processing time required to find the optimal solution.
first_indexed 2024-04-24T18:52:46Z
format Article
id doaj.art-5958803ef5434670b4df1abd7fbefef3
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-24T18:52:46Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-5958803ef5434670b4df1abd7fbefef32024-03-26T17:48:44ZengIEEEIEEE Access2169-35362024-01-0112379193793610.1109/ACCESS.2024.337438410462115Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless CommunicationSahar Baghdady0https://orcid.org/0009-0000-7540-506XSeyed Masoud Mirrezaei1https://orcid.org/0000-0001-8493-2072Rashid Mirzavand2https://orcid.org/0000-0001-8736-3418Electrical Engineering Department, Shahrood University of Technology, Shahrood, IranElectrical Engineering Department, Shahrood University of Technology, Shahrood, IranElectrical and Computer Engineering Department, University of Alberta, Edmonton, CanadaUnmanned aerial vehicles (UAVs) or drones have attracted much attention in wireless communication networks because of their agility, unique flexibility, low cost of implementation, and the high strength of the line-of-sight (LoS) channel. They are widely used in different scenarios. In many environments with complex geographical conditions or in situations where areas are affected by natural disasters, UAVs can be used as base stations (BSs) for downlink ground users. The article proposes a communication system using multiple UAV-mounted BSs to improve coverage rate and minimize the number of required UAVs. The problem is formulated as a mixed-integer programming problem with constraints on the quality of service (QoS) and serviceability of each UAV. A three-step method is developed to solve the problem, which includes deriving the maximum service radius of UAVs using the Karush-Kuhn-Tucker (KKT) method, minimizing the number of required UAVs using reinforcement learning (RL) algorithm, and designing the three-dimensional (3D) position and frequency band of each UAV to increase signal power and reduce interference. The simulation results show that the RLP algorithm outperforms other algorithms in terms of coverage rate, user clustering, increased signal, reduced interference, and processing time required to find the optimal solution.https://ieeexplore.ieee.org/document/10462115/Wireless communicationunmanned aerial vehicles (UAVs)base stations (BSs)three-dimensional (3D) deploymentuser clustering
spellingShingle Sahar Baghdady
Seyed Masoud Mirrezaei
Rashid Mirzavand
Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
IEEE Access
Wireless communication
unmanned aerial vehicles (UAVs)
base stations (BSs)
three-dimensional (3D) deployment
user clustering
title Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
title_full Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
title_fullStr Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
title_full_unstemmed Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
title_short Reinforcement Learning Placement Algorithm for Optimization of UAV Network in Wireless Communication
title_sort reinforcement learning placement algorithm for optimization of uav network in wireless communication
topic Wireless communication
unmanned aerial vehicles (UAVs)
base stations (BSs)
three-dimensional (3D) deployment
user clustering
url https://ieeexplore.ieee.org/document/10462115/
work_keys_str_mv AT saharbaghdady reinforcementlearningplacementalgorithmforoptimizationofuavnetworkinwirelesscommunication
AT seyedmasoudmirrezaei reinforcementlearningplacementalgorithmforoptimizationofuavnetworkinwirelesscommunication
AT rashidmirzavand reinforcementlearningplacementalgorithmforoptimizationofuavnetworkinwirelesscommunication