Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network

Network slicing is a key technology in fifth-generation (5G) networks that allows network operators to create multiple logical networks over a shared physical infrastructure to meet the requirements of diverse use cases. Among core functions to implement network slicing, resource management and scal...

Full description

Bibliographic Details
Main Authors: Chien-Nguyen Nhu, Minho Park
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9828381/
_version_ 1811218364911058944
author Chien-Nguyen Nhu
Minho Park
author_facet Chien-Nguyen Nhu
Minho Park
author_sort Chien-Nguyen Nhu
collection DOAJ
description Network slicing is a key technology in fifth-generation (5G) networks that allows network operators to create multiple logical networks over a shared physical infrastructure to meet the requirements of diverse use cases. Among core functions to implement network slicing, resource management and scaling are difficult challenges. Network operators must ensure the Service Level Agreement (SLA) requirements for latency, bandwidth, resources, etc for each network slice while utilizing the limited resources efficiently, i.e., optimal resource assignment and dynamic resource scaling for each network slice. Existing resource scaling approaches can be classified into reactive and proactive types. The former makes a resource scaling decision when the resource usage of virtual network functions (VNFs) exceeds a predefined threshold, and the latter forecasts the future resource usage of VNFs in network slices by utilizing classical statistical models or deep learning models. However, both have a trade-off between assurance and efficiency. For instance, the lower threshold in the reactive approach or more marginal prediction in the proactive approach can meet the requirements more certainly, but it may cause unnecessary resource wastage. To overcome the trade-off, we first propose a novel and efficient proactive resource forecasting algorithm. The proposed algorithm introduces an attention-based encoder-decoder model for multivariate time series forecasting to achieve high short-term and long-term prediction accuracies. It helps network slices be scaled up and down effectively and reduces the costs of SLA violations and resource overprovisioning. Using the attention mechanism, the model attends to every hidden state of the sequential input at every time step to select the most important time steps affecting the prediction results. We also designed an automated resource configuration mechanism responsible for monitoring resources and automatically adding or removing VNF instances of network slices, which helps network operators satisfy service requirements even when the traffic of end-user requests changes dynamically. Comprehensive experiments demonstrate that our proposed solution outperforms other solutions in terms of short-term and long-term predictions while reducing the cost of SLA violations and resource overprovisioning and enhancing the delay quality of network slices.
first_indexed 2024-04-12T07:08:09Z
format Article
id doaj.art-39acec8f42e746c2a46e16e3961b01e0
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T07:08:09Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-39acec8f42e746c2a46e16e3961b01e02022-12-22T03:42:43ZengIEEEIEEE Access2169-35362022-01-0110729557297210.1109/ACCESS.2022.31906409828381Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core NetworkChien-Nguyen Nhu0https://orcid.org/0000-0002-5118-4212Minho Park1https://orcid.org/0000-0003-3033-192XDepartment of Information Communication Convergence Technology, Soongsil University, Seoul, South KoreaSchool of Electronic Engineering, Soongsil University, Seoul, South KoreaNetwork slicing is a key technology in fifth-generation (5G) networks that allows network operators to create multiple logical networks over a shared physical infrastructure to meet the requirements of diverse use cases. Among core functions to implement network slicing, resource management and scaling are difficult challenges. Network operators must ensure the Service Level Agreement (SLA) requirements for latency, bandwidth, resources, etc for each network slice while utilizing the limited resources efficiently, i.e., optimal resource assignment and dynamic resource scaling for each network slice. Existing resource scaling approaches can be classified into reactive and proactive types. The former makes a resource scaling decision when the resource usage of virtual network functions (VNFs) exceeds a predefined threshold, and the latter forecasts the future resource usage of VNFs in network slices by utilizing classical statistical models or deep learning models. However, both have a trade-off between assurance and efficiency. For instance, the lower threshold in the reactive approach or more marginal prediction in the proactive approach can meet the requirements more certainly, but it may cause unnecessary resource wastage. To overcome the trade-off, we first propose a novel and efficient proactive resource forecasting algorithm. The proposed algorithm introduces an attention-based encoder-decoder model for multivariate time series forecasting to achieve high short-term and long-term prediction accuracies. It helps network slices be scaled up and down effectively and reduces the costs of SLA violations and resource overprovisioning. Using the attention mechanism, the model attends to every hidden state of the sequential input at every time step to select the most important time steps affecting the prediction results. We also designed an automated resource configuration mechanism responsible for monitoring resources and automatically adding or removing VNF instances of network slices, which helps network operators satisfy service requirements even when the traffic of end-user requests changes dynamically. Comprehensive experiments demonstrate that our proposed solution outperforms other solutions in terms of short-term and long-term predictions while reducing the cost of SLA violations and resource overprovisioning and enhancing the delay quality of network slices.https://ieeexplore.ieee.org/document/9828381/Network slicingauto scalingresource predictiondeep learningattention-based encoder-decoder
spellingShingle Chien-Nguyen Nhu
Minho Park
Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
IEEE Access
Network slicing
auto scaling
resource prediction
deep learning
attention-based encoder-decoder
title Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
title_full Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
title_fullStr Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
title_full_unstemmed Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
title_short Dynamic Network Slice Scaling Assisted by Attention-Based Prediction in 5G Core Network
title_sort dynamic network slice scaling assisted by attention based prediction in 5g core network
topic Network slicing
auto scaling
resource prediction
deep learning
attention-based encoder-decoder
url https://ieeexplore.ieee.org/document/9828381/
work_keys_str_mv AT chiennguyennhu dynamicnetworkslicescalingassistedbyattentionbasedpredictionin5gcorenetwork
AT minhopark dynamicnetworkslicescalingassistedbyattentionbasedpredictionin5gcorenetwork