Deep reinforcement learning based offloading decision algorithm for vehicular edge computing

Task offloading decision is one of the core technologies of vehicular edge computing. Efficient offloading decision can not only meet the requirements of complex vehicle tasks in terms of time, energy consumption and computing performance, but also reduce the competition and consumption of network r...

Full description

Bibliographic Details
Main Authors: Xi Hu, Yang Huang
Format: Article
Language:English
Published: PeerJ Inc. 2022-10-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1126.pdf
_version_ 1798020183257776128
author Xi Hu
Yang Huang
author_facet Xi Hu
Yang Huang
author_sort Xi Hu
collection DOAJ
description Task offloading decision is one of the core technologies of vehicular edge computing. Efficient offloading decision can not only meet the requirements of complex vehicle tasks in terms of time, energy consumption and computing performance, but also reduce the competition and consumption of network resources. Traditional distributed task offloading decision is made by vehicles based on local states and can’t maximize the resource utilization of Mobile Edge Computing (MEC) server. Moreover, the mobility of vehicles is rarely taken into consideration for simplification. This article proposes a deep reinforcement learning based task offloading decision algorithm, namely Deep Reinforcement learning based offloading decision (DROD) for Vehicular Edge Computing (VEC). In this work, the mobility of vehicles and the signal blocking commonly in VEC circumstance are considered in our optimal problem of minimizing the system overhead. For resolving the optimal problem, the DROD employs Markov decision process to model the interactions between vehicles and MEC server, and an improved deep deterministic policy gradient algorithm called NLDDPG to train the model iteratively to obtain the optimal decision. The NLDDPG takes the normalized state space as input and introduces LSTM structure into the actor-critic network for improving the efficiency of learning. Finally, two series of experiments are conducted to explore DROD. Firstly, the influences of core hyper-parameters on the performances of DROD are discussed, and the optimal values are determined. Secondly, the DROD is compared with some other baseline algorithms, and the results show that DROD is 25% better than DQN, 10% better than NLDQN and 130% better than DDDPG.
first_indexed 2024-04-11T16:53:29Z
format Article
id doaj.art-abc496fe7d3347f5b2b491231814d362
institution Directory Open Access Journal
issn 2376-5992
language English
last_indexed 2024-04-11T16:53:29Z
publishDate 2022-10-01
publisher PeerJ Inc.
record_format Article
series PeerJ Computer Science
spelling doaj.art-abc496fe7d3347f5b2b491231814d3622022-12-22T04:13:22ZengPeerJ Inc.PeerJ Computer Science2376-59922022-10-018e112610.7717/peerj-cs.1126Deep reinforcement learning based offloading decision algorithm for vehicular edge computingXi HuYang HuangTask offloading decision is one of the core technologies of vehicular edge computing. Efficient offloading decision can not only meet the requirements of complex vehicle tasks in terms of time, energy consumption and computing performance, but also reduce the competition and consumption of network resources. Traditional distributed task offloading decision is made by vehicles based on local states and can’t maximize the resource utilization of Mobile Edge Computing (MEC) server. Moreover, the mobility of vehicles is rarely taken into consideration for simplification. This article proposes a deep reinforcement learning based task offloading decision algorithm, namely Deep Reinforcement learning based offloading decision (DROD) for Vehicular Edge Computing (VEC). In this work, the mobility of vehicles and the signal blocking commonly in VEC circumstance are considered in our optimal problem of minimizing the system overhead. For resolving the optimal problem, the DROD employs Markov decision process to model the interactions between vehicles and MEC server, and an improved deep deterministic policy gradient algorithm called NLDDPG to train the model iteratively to obtain the optimal decision. The NLDDPG takes the normalized state space as input and introduces LSTM structure into the actor-critic network for improving the efficiency of learning. Finally, two series of experiments are conducted to explore DROD. Firstly, the influences of core hyper-parameters on the performances of DROD are discussed, and the optimal values are determined. Secondly, the DROD is compared with some other baseline algorithms, and the results show that DROD is 25% better than DQN, 10% better than NLDQN and 130% better than DDDPG.https://peerj.com/articles/cs-1126.pdfVehicular edge computingOffloading decisionMarkov decision processDeep reinforcement learningSystem overhead
spellingShingle Xi Hu
Yang Huang
Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
PeerJ Computer Science
Vehicular edge computing
Offloading decision
Markov decision process
Deep reinforcement learning
System overhead
title Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
title_full Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
title_fullStr Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
title_full_unstemmed Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
title_short Deep reinforcement learning based offloading decision algorithm for vehicular edge computing
title_sort deep reinforcement learning based offloading decision algorithm for vehicular edge computing
topic Vehicular edge computing
Offloading decision
Markov decision process
Deep reinforcement learning
System overhead
url https://peerj.com/articles/cs-1126.pdf
work_keys_str_mv AT xihu deepreinforcementlearningbasedoffloadingdecisionalgorithmforvehicularedgecomputing
AT yanghuang deepreinforcementlearningbasedoffloadingdecisionalgorithmforvehicularedgecomputing