Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System

In this paper, we propose a novel Deep Reinforcement Learning Evolution Algorithm (DRLEA) method to control the antenna parameters of the High-Altitude Platform Station (HAPS) mobile to reduce the number of low-throughput users. Considering the random movement of the HAPS caused by the winds, the th...

Full description

Bibliographic Details
Main Authors: Siyuan Yang, Mondher Bouazizi, Tomoaki Ohtsuki, Yohei Shibata, Wataru Takabatake, Kenji Hoshino, Atsushi Nagate
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/15/1/34
_version_ 1797442265390514176
author Siyuan Yang
Mondher Bouazizi
Tomoaki Ohtsuki
Yohei Shibata
Wataru Takabatake
Kenji Hoshino
Atsushi Nagate
author_facet Siyuan Yang
Mondher Bouazizi
Tomoaki Ohtsuki
Yohei Shibata
Wataru Takabatake
Kenji Hoshino
Atsushi Nagate
author_sort Siyuan Yang
collection DOAJ
description In this paper, we propose a novel Deep Reinforcement Learning Evolution Algorithm (DRLEA) method to control the antenna parameters of the High-Altitude Platform Station (HAPS) mobile to reduce the number of low-throughput users. Considering the random movement of the HAPS caused by the winds, the throughput of the users might decrease. Therefore, we propose a method that can dynamically adjust the antenna parameters based on the throughput of the users in the coverage area to reduce the number of low-throughput users by improving the users’ throughput. Different from other model-based reinforcement learning methods, such as the Deep <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi mathvariant="script">Q</mi></semantics></math></inline-formula> Network (DQN), the proposed method combines the Evolution Algorithm (EA) with Reinforcement Learning (RL) to avoid the sub-optimal solutions in each state. Moreover, we consider non-uniform user distribution scenarios, which are common in the real world, rather than ideal uniform user distribution scenarios. To evaluate the proposed method, we do the simulations under four different real user distribution scenarios and compare the proposed method with the conventional EA and RL methods. The simulation results show that the proposed method effectively reduces the number of low throughput users after the HAPS moves.
first_indexed 2024-03-09T12:39:20Z
format Article
id doaj.art-e655c987d84b4879bc428929d38e46e8
institution Directory Open Access Journal
issn 1999-5903
language English
last_indexed 2024-03-09T12:39:20Z
publishDate 2023-01-01
publisher MDPI AG
record_format Article
series Future Internet
spelling doaj.art-e655c987d84b4879bc428929d38e46e82023-11-30T22:20:51ZengMDPI AGFuture Internet1999-59032023-01-011513410.3390/fi15010034Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS SystemSiyuan Yang0Mondher Bouazizi1Tomoaki Ohtsuki2Yohei Shibata3Wataru Takabatake4Kenji Hoshino5Atsushi Nagate6Graduate School of Science and Technology, Keio University, Yokohama 223-8522, JapanDepartment of Information and Computer Science, Faculty of Science and Technology, Keio University, Yokohama 223-8522, JapanDepartment of Information and Computer Science, Faculty of Science and Technology, Keio University, Yokohama 223-8522, JapanSoftBank Corp. Technology Research Laboratory, Tokyo 135-0064, JapanSoftBank Corp. Technology Research Laboratory, Tokyo 135-0064, JapanSoftBank Corp. Technology Research Laboratory, Tokyo 135-0064, JapanSoftBank Corp. Technology Research Laboratory, Tokyo 135-0064, JapanIn this paper, we propose a novel Deep Reinforcement Learning Evolution Algorithm (DRLEA) method to control the antenna parameters of the High-Altitude Platform Station (HAPS) mobile to reduce the number of low-throughput users. Considering the random movement of the HAPS caused by the winds, the throughput of the users might decrease. Therefore, we propose a method that can dynamically adjust the antenna parameters based on the throughput of the users in the coverage area to reduce the number of low-throughput users by improving the users’ throughput. Different from other model-based reinforcement learning methods, such as the Deep <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mi mathvariant="script">Q</mi></semantics></math></inline-formula> Network (DQN), the proposed method combines the Evolution Algorithm (EA) with Reinforcement Learning (RL) to avoid the sub-optimal solutions in each state. Moreover, we consider non-uniform user distribution scenarios, which are common in the real world, rather than ideal uniform user distribution scenarios. To evaluate the proposed method, we do the simulations under four different real user distribution scenarios and compare the proposed method with the conventional EA and RL methods. The simulation results show that the proposed method effectively reduces the number of low throughput users after the HAPS moves.https://www.mdpi.com/1999-5903/15/1/34HAPSantenna controlreinforcement learningevolution algorithm
spellingShingle Siyuan Yang
Mondher Bouazizi
Tomoaki Ohtsuki
Yohei Shibata
Wataru Takabatake
Kenji Hoshino
Atsushi Nagate
Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
Future Internet
HAPS
antenna control
reinforcement learning
evolution algorithm
title Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
title_full Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
title_fullStr Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
title_full_unstemmed Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
title_short Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System
title_sort deep reinforcement learning evolution algorithm for dynamic antenna control in multi cell configuration haps system
topic HAPS
antenna control
reinforcement learning
evolution algorithm
url https://www.mdpi.com/1999-5903/15/1/34
work_keys_str_mv AT siyuanyang deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT mondherbouazizi deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT tomoakiohtsuki deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT yoheishibata deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT watarutakabatake deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT kenjihoshino deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem
AT atsushinagate deepreinforcementlearningevolutionalgorithmfordynamicantennacontrolinmulticellconfigurationhapssystem