A deep reinforcement learning based research for optimal offloading decision

Currently, a concern about power resource constraints in the distribution environment is being voiced increasingly, where the increase of power consumption devices overwhelms the terminal load unaffordable and the quality of power consumption cannot be guaranteed. How to acquire the optimal offloadi...

Full description

Bibliographic Details
Main Authors: Jianji Ren, Donghao Yang, Yongliang Yuan, Huihui Wei, Zhenxi Wang
Format: Article
Language:English
Published: AIP Publishing LLC 2023-08-01
Series:AIP Advances
Online Access:http://dx.doi.org/10.1063/5.0157491
_version_ 1797689658845429760
author Jianji Ren
Donghao Yang
Yongliang Yuan
Huihui Wei
Zhenxi Wang
author_facet Jianji Ren
Donghao Yang
Yongliang Yuan
Huihui Wei
Zhenxi Wang
author_sort Jianji Ren
collection DOAJ
description Currently, a concern about power resource constraints in the distribution environment is being voiced increasingly, where the increase of power consumption devices overwhelms the terminal load unaffordable and the quality of power consumption cannot be guaranteed. How to acquire the optimal offloading decision of power resources has become a problem that needs to be addressed urgently. To tackle this challenge, a novel reinforcement learning algorithm named Deep Q Network with a partial offloading strategy (DQNP) is proposed to optimize power resource allocation for high computational demands. In the DQNP, a coupled coordination degree model and Lyapunov algorithm are introduced, which trade-offs and decouples the relationships between local-edge and latency–energy consumption. To derive the optimal offloading decision, the resource computation utility function is selected as the objective function. In addition, model pruning is availed to further improve the training time and inference results. Results show that the proposed offloading mechanism can significantly decrease the function value and decline the weighted sum of latency and energy consumption by an average of 3.61%–7.31% relative to other state-of-the-art algorithms. Additionally, the energy loss in the power distribution process is successfully mitigated; furthermore, the effectiveness of the proposed algorithm is also verified.
first_indexed 2024-03-12T01:48:30Z
format Article
id doaj.art-98854fac853e4e35aeac2831d7430623
institution Directory Open Access Journal
issn 2158-3226
language English
last_indexed 2024-03-12T01:48:30Z
publishDate 2023-08-01
publisher AIP Publishing LLC
record_format Article
series AIP Advances
spelling doaj.art-98854fac853e4e35aeac2831d74306232023-09-08T16:03:30ZengAIP Publishing LLCAIP Advances2158-32262023-08-01138085204085204-910.1063/5.0157491A deep reinforcement learning based research for optimal offloading decisionJianji Ren0Donghao Yang1Yongliang Yuan2Huihui Wei3Zhenxi Wang4School of Software, Henan Polytechnic University, Jiaozuo 454003, Henan, ChinaSchool of Software, Henan Polytechnic University, Jiaozuo 454003, Henan, ChinaSchool of Mechanical and Power Engineering, Henan Polytechnic University, Jiaozuo 454003, Henan, ChinaSchool of Software, Henan Polytechnic University, Jiaozuo 454003, Henan, ChinaSchool of Software, Henan Polytechnic University, Jiaozuo 454003, Henan, ChinaCurrently, a concern about power resource constraints in the distribution environment is being voiced increasingly, where the increase of power consumption devices overwhelms the terminal load unaffordable and the quality of power consumption cannot be guaranteed. How to acquire the optimal offloading decision of power resources has become a problem that needs to be addressed urgently. To tackle this challenge, a novel reinforcement learning algorithm named Deep Q Network with a partial offloading strategy (DQNP) is proposed to optimize power resource allocation for high computational demands. In the DQNP, a coupled coordination degree model and Lyapunov algorithm are introduced, which trade-offs and decouples the relationships between local-edge and latency–energy consumption. To derive the optimal offloading decision, the resource computation utility function is selected as the objective function. In addition, model pruning is availed to further improve the training time and inference results. Results show that the proposed offloading mechanism can significantly decrease the function value and decline the weighted sum of latency and energy consumption by an average of 3.61%–7.31% relative to other state-of-the-art algorithms. Additionally, the energy loss in the power distribution process is successfully mitigated; furthermore, the effectiveness of the proposed algorithm is also verified.http://dx.doi.org/10.1063/5.0157491
spellingShingle Jianji Ren
Donghao Yang
Yongliang Yuan
Huihui Wei
Zhenxi Wang
A deep reinforcement learning based research for optimal offloading decision
AIP Advances
title A deep reinforcement learning based research for optimal offloading decision
title_full A deep reinforcement learning based research for optimal offloading decision
title_fullStr A deep reinforcement learning based research for optimal offloading decision
title_full_unstemmed A deep reinforcement learning based research for optimal offloading decision
title_short A deep reinforcement learning based research for optimal offloading decision
title_sort deep reinforcement learning based research for optimal offloading decision
url http://dx.doi.org/10.1063/5.0157491
work_keys_str_mv AT jianjiren adeepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT donghaoyang adeepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT yongliangyuan adeepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT huihuiwei adeepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT zhenxiwang adeepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT jianjiren deepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT donghaoyang deepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT yongliangyuan deepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT huihuiwei deepreinforcementlearningbasedresearchforoptimaloffloadingdecision
AT zhenxiwang deepreinforcementlearningbasedresearchforoptimaloffloadingdecision