On Meeting a Maximum Delay Constraint Using Reinforcement Learning

Several emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent lat...

Full description

Bibliographic Details
Main Authors: Hossein Shafieirad, Raviraj S. Adve
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9887962/
_version_ 1798000468395294720
author Hossein Shafieirad
Raviraj S. Adve
author_facet Hossein Shafieirad
Raviraj S. Adve
author_sort Hossein Shafieirad
collection DOAJ
description Several emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent latency constraints. In this paper, we consider scheduling under a max-delay constraint; this is an NP-hard problem. We propose a novel approach to tackle the scheduling problem by directly addressing the constraint. We consider the downlink of a multi-cell wireless communication network with nodes communicating with users each facing their own delay constraint on randomly arrived packets. Packets must be scheduled to meet the users’ delay constraints. Our main contributions are first, proposing a new search approach, Super State Monte-Carlo Tree Search (SS-MCTS), as a version of regular MCTS modified for large-scale probabilistic environments; second, developing trained value and policy networks to reduce computational complexity, and finally, addressing the scheduling problem through a reinforcement learning framework. Our numerical results demonstrate that the proposed approach significantly improves the packet delivery rate over a baseline approach while meeting the max-delay constraint, and addressing the scalability as the main issues in large action-state spaces.
first_indexed 2024-04-11T11:20:45Z
format Article
id doaj.art-9877bc201e44404b8aac2d8c968e386a
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T11:20:45Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9877bc201e44404b8aac2d8c968e386a2022-12-22T04:27:04ZengIEEEIEEE Access2169-35362022-01-0110978979791110.1109/ACCESS.2022.32060359887962On Meeting a Maximum Delay Constraint Using Reinforcement LearningHossein Shafieirad0https://orcid.org/0000-0002-0633-404XRaviraj S. Adve1https://orcid.org/0000-0003-0224-2209Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, CanadaDepartment of Electrical and Computer Engineering, University of Toronto, Toronto, ON, CanadaSeveral emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent latency constraints. In this paper, we consider scheduling under a max-delay constraint; this is an NP-hard problem. We propose a novel approach to tackle the scheduling problem by directly addressing the constraint. We consider the downlink of a multi-cell wireless communication network with nodes communicating with users each facing their own delay constraint on randomly arrived packets. Packets must be scheduled to meet the users’ delay constraints. Our main contributions are first, proposing a new search approach, Super State Monte-Carlo Tree Search (SS-MCTS), as a version of regular MCTS modified for large-scale probabilistic environments; second, developing trained value and policy networks to reduce computational complexity, and finally, addressing the scheduling problem through a reinforcement learning framework. Our numerical results demonstrate that the proposed approach significantly improves the packet delivery rate over a baseline approach while meeting the max-delay constraint, and addressing the scalability as the main issues in large action-state spaces.https://ieeexplore.ieee.org/document/9887962/Monte-Carlo tree searchschedulingreinforcement learningmax-delay constraints
spellingShingle Hossein Shafieirad
Raviraj S. Adve
On Meeting a Maximum Delay Constraint Using Reinforcement Learning
IEEE Access
Monte-Carlo tree search
scheduling
reinforcement learning
max-delay constraints
title On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_full On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_fullStr On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_full_unstemmed On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_short On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_sort on meeting a maximum delay constraint using reinforcement learning
topic Monte-Carlo tree search
scheduling
reinforcement learning
max-delay constraints
url https://ieeexplore.ieee.org/document/9887962/
work_keys_str_mv AT hosseinshafieirad onmeetingamaximumdelayconstraintusingreinforcementlearning
AT ravirajsadve onmeetingamaximumdelayconstraintusingreinforcementlearning