Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO

Network slicing is a critical technology for fifth-generation (5G) networks, owing to its merits in meeting the diversified requirements of users. Effective resource allocation for network slicing in Radio Access Networks (RAN) is still challenging owing to dynamic service requirements. Therein, aut...

Full description

Bibliographic Details
Main Authors: Dandan Yan, Benjamin K. Ng, Wei Ke, Chan-Tong Lam
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10186882/
_version_ 1797768419698802688
author Dandan Yan
Benjamin K. Ng
Wei Ke
Chan-Tong Lam
author_facet Dandan Yan
Benjamin K. Ng
Wei Ke
Chan-Tong Lam
author_sort Dandan Yan
collection DOAJ
description Network slicing is a critical technology for fifth-generation (5G) networks, owing to its merits in meeting the diversified requirements of users. Effective resource allocation for network slicing in Radio Access Networks (RAN) is still challenging owing to dynamic service requirements. Therein, automatic resource allocation based on environmental changes is of significant importance for network slicing. In this study, we used deep reinforcement learning (DRL) to allocate resources for network slicing in a RAN with the aid of massive multiple-input multiple-output (MIMO). The DRL agent interacts with the environment to execute autonomous resource allocation. We considered a two-level scheduling framework that aims to maximize the quality of experience (QoE) and spectrum efficiency (SE) of slices. The proposed algorithm can find a near-optimal solution. We used the standard DRL advantage actor-critic (A2C) algorithm to implement upper-level inter-slice bandwidth resource allocation that considers service traffic dynamics in a large timescale. Lower-level scheduling is a mixed-integer stochastic optimization problem with several constraints. We combined the proportional fair scheduling algorithm and the water filling algorithm to perform resource block (RB) and power allocation in a small timescale. The results show that the QoE and SE of all slices using the A2C algorithm achieved a significant performance improvement over the other algorithms. The efficiency of the proposed method was supported by the simulation results.
first_indexed 2024-03-12T20:53:56Z
format Article
id doaj.art-8aa126b2107e4832a40703ddd8952566
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-12T20:53:56Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-8aa126b2107e4832a40703ddd89525662023-07-31T23:00:43ZengIEEEIEEE Access2169-35362023-01-0111758997591110.1109/ACCESS.2023.329685110186882Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMODandan Yan0https://orcid.org/0009-0008-2220-0085Benjamin K. Ng1https://orcid.org/0000-0001-5901-5694Wei Ke2https://orcid.org/0000-0003-0952-0961Chan-Tong Lam3https://orcid.org/0000-0002-8022-7744Faculty of Applied Sciences, Macao Polytechnic University, Macau, ChinaFaculty of Applied Sciences, Macao Polytechnic University, Macau, ChinaFaculty of Applied Sciences, Macao Polytechnic University, Macau, ChinaFaculty of Applied Sciences, Macao Polytechnic University, Macau, ChinaNetwork slicing is a critical technology for fifth-generation (5G) networks, owing to its merits in meeting the diversified requirements of users. Effective resource allocation for network slicing in Radio Access Networks (RAN) is still challenging owing to dynamic service requirements. Therein, automatic resource allocation based on environmental changes is of significant importance for network slicing. In this study, we used deep reinforcement learning (DRL) to allocate resources for network slicing in a RAN with the aid of massive multiple-input multiple-output (MIMO). The DRL agent interacts with the environment to execute autonomous resource allocation. We considered a two-level scheduling framework that aims to maximize the quality of experience (QoE) and spectrum efficiency (SE) of slices. The proposed algorithm can find a near-optimal solution. We used the standard DRL advantage actor-critic (A2C) algorithm to implement upper-level inter-slice bandwidth resource allocation that considers service traffic dynamics in a large timescale. Lower-level scheduling is a mixed-integer stochastic optimization problem with several constraints. We combined the proportional fair scheduling algorithm and the water filling algorithm to perform resource block (RB) and power allocation in a small timescale. The results show that the QoE and SE of all slices using the A2C algorithm achieved a significant performance improvement over the other algorithms. The efficiency of the proposed method was supported by the simulation results.https://ieeexplore.ieee.org/document/10186882/Network slicingresource allocationradio access networks (RAN)massive MIMOadvantage actor critic (A2C)
spellingShingle Dandan Yan
Benjamin K. Ng
Wei Ke
Chan-Tong Lam
Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
IEEE Access
Network slicing
resource allocation
radio access networks (RAN)
massive MIMO
advantage actor critic (A2C)
title Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
title_full Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
title_fullStr Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
title_full_unstemmed Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
title_short Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO
title_sort deep reinforcement learning based resource allocation for network slicing with massive mimo
topic Network slicing
resource allocation
radio access networks (RAN)
massive MIMO
advantage actor critic (A2C)
url https://ieeexplore.ieee.org/document/10186882/
work_keys_str_mv AT dandanyan deepreinforcementlearningbasedresourceallocationfornetworkslicingwithmassivemimo
AT benjaminkng deepreinforcementlearningbasedresourceallocationfornetworkslicingwithmassivemimo
AT weike deepreinforcementlearningbasedresourceallocationfornetworkslicingwithmassivemimo
AT chantonglam deepreinforcementlearningbasedresourceallocationfornetworkslicingwithmassivemimo