On Meeting a Maximum Delay Constraint Using Reinforcement Learning

Several emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent lat...

Full description

Bibliographic Details
Main Authors:	Hossein Shafieirad, Raviraj S. Adve
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Monte-Carlo tree search scheduling reinforcement learning max-delay constraints
Online Access:	https://ieeexplore.ieee.org/document/9887962/

_version_	1798000468395294720
author	Hossein Shafieirad Raviraj S. Adve
author_facet	Hossein Shafieirad Raviraj S. Adve
author_sort	Hossein Shafieirad
collection	DOAJ
description	Several emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent latency constraints. In this paper, we consider scheduling under a max-delay constraint; this is an NP-hard problem. We propose a novel approach to tackle the scheduling problem by directly addressing the constraint. We consider the downlink of a multi-cell wireless communication network with nodes communicating with users each facing their own delay constraint on randomly arrived packets. Packets must be scheduled to meet the users’ delay constraints. Our main contributions are first, proposing a new search approach, Super State Monte-Carlo Tree Search (SS-MCTS), as a version of regular MCTS modified for large-scale probabilistic environments; second, developing trained value and policy networks to reduce computational complexity, and finally, addressing the scheduling problem through a reinforcement learning framework. Our numerical results demonstrate that the proposed approach significantly improves the packet delivery rate over a baseline approach while meeting the max-delay constraint, and addressing the scalability as the main issues in large action-state spaces.
first_indexed	2024-04-11T11:20:45Z
format	Article
id	doaj.art-9877bc201e44404b8aac2d8c968e386a
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-11T11:20:45Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-9877bc201e44404b8aac2d8c968e386a2022-12-22T04:27:04ZengIEEEIEEE Access2169-35362022-01-0110978979791110.1109/ACCESS.2022.32060359887962On Meeting a Maximum Delay Constraint Using Reinforcement LearningHossein Shafieirad0https://orcid.org/0000-0002-0633-404XRaviraj S. Adve1https://orcid.org/0000-0003-0224-2209Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, CanadaDepartment of Electrical and Computer Engineering, University of Toronto, Toronto, ON, CanadaSeveral emerging applications in wireless communications are required to achieve low latency, but also high traffic rates and reliabilities. From a latency point of view, most of the state-of-the-art techniques consider the average latency which may not directly apply to scenarios with stringent latency constraints. In this paper, we consider scheduling under a max-delay constraint; this is an NP-hard problem. We propose a novel approach to tackle the scheduling problem by directly addressing the constraint. We consider the downlink of a multi-cell wireless communication network with nodes communicating with users each facing their own delay constraint on randomly arrived packets. Packets must be scheduled to meet the users’ delay constraints. Our main contributions are first, proposing a new search approach, Super State Monte-Carlo Tree Search (SS-MCTS), as a version of regular MCTS modified for large-scale probabilistic environments; second, developing trained value and policy networks to reduce computational complexity, and finally, addressing the scheduling problem through a reinforcement learning framework. Our numerical results demonstrate that the proposed approach significantly improves the packet delivery rate over a baseline approach while meeting the max-delay constraint, and addressing the scalability as the main issues in large action-state spaces.https://ieeexplore.ieee.org/document/9887962/Monte-Carlo tree searchschedulingreinforcement learningmax-delay constraints
spellingShingle	Hossein Shafieirad Raviraj S. Adve On Meeting a Maximum Delay Constraint Using Reinforcement Learning IEEE Access Monte-Carlo tree search scheduling reinforcement learning max-delay constraints
title	On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_full	On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_fullStr	On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_full_unstemmed	On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_short	On Meeting a Maximum Delay Constraint Using Reinforcement Learning
title_sort	on meeting a maximum delay constraint using reinforcement learning
topic	Monte-Carlo tree search scheduling reinforcement learning max-delay constraints
url	https://ieeexplore.ieee.org/document/9887962/
work_keys_str_mv	AT hosseinshafieirad onmeetingamaximumdelayconstraintusingreinforcementlearning AT ravirajsadve onmeetingamaximumdelayconstraintusingreinforcementlearning

On Meeting a Maximum Delay Constraint Using Reinforcement Learning

Similar Items