Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach

Several issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. N...

Full description

Bibliographic Details
Main Authors: Mehran Berahman, Majid Rostami-Shahrbabaki, Klaus Bogenberger
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Future Transportation
Subjects:
Online Access:https://www.mdpi.com/2673-7590/2/4/57
_version_ 1797458106526990336
author Mehran Berahman
Majid Rostami-Shahrbabaki
Klaus Bogenberger
author_facet Mehran Berahman
Majid Rostami-Shahrbabaki
Klaus Bogenberger
author_sort Mehran Berahman
collection DOAJ
description Several issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. Nonetheless, the lack of research on designing a single algorithm capable of simultaneously coping with speed-tracking and maintaining a secure headway, as well as the gap-closing/opening problems, is apparent. As deep reinforcement learning (DRL) applications in driving strategies are promising, this paper develops a multi-task deep deterministic policy gradient (DDPG) car-following algorithm in a platoon system. The proposed approach combines gap closing/opening with a unified platoon control strategy; as such, an effective virtual inter-vehicle distance is employed in the developed DRL-based platoon controller reward. This innovative new distance definition, which is based on the action taken by the ego-vehicle, leads to a precise comprehension of the agent’s actions. Moreover, by imposing a specific constraint on a variation of the ego-vehicle’s relative speed with respect to its predecessor, the speed chattering of the ego-vehicle is reduced. The developed algorithm is implemented in the realistic traffic simulator, SUMO (Simulation of Urban Mobility), and the performance of the developed control strategy is evaluated under different traffic scenarios.
first_indexed 2024-03-09T16:33:21Z
format Article
id doaj.art-70d7c1d324124af6be9615490823e013
institution Directory Open Access Journal
issn 2673-7590
language English
last_indexed 2024-03-09T16:33:21Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Future Transportation
spelling doaj.art-70d7c1d324124af6be9615490823e0132023-11-24T14:59:28ZengMDPI AGFuture Transportation2673-75902022-12-01241028104610.3390/futuretransp2040057Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient ApproachMehran Berahman0Majid Rostami-Shahrbabaki1Klaus Bogenberger2Department of Electrical and Computer Engineering, Shiraz University, Shiraz 51154-71348, IranChair of Traffic Engineering and Control, Technical University of Munich, 80333 Munich, GermanyChair of Traffic Engineering and Control, Technical University of Munich, 80333 Munich, GermanySeveral issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. Nonetheless, the lack of research on designing a single algorithm capable of simultaneously coping with speed-tracking and maintaining a secure headway, as well as the gap-closing/opening problems, is apparent. As deep reinforcement learning (DRL) applications in driving strategies are promising, this paper develops a multi-task deep deterministic policy gradient (DDPG) car-following algorithm in a platoon system. The proposed approach combines gap closing/opening with a unified platoon control strategy; as such, an effective virtual inter-vehicle distance is employed in the developed DRL-based platoon controller reward. This innovative new distance definition, which is based on the action taken by the ego-vehicle, leads to a precise comprehension of the agent’s actions. Moreover, by imposing a specific constraint on a variation of the ego-vehicle’s relative speed with respect to its predecessor, the speed chattering of the ego-vehicle is reduced. The developed algorithm is implemented in the realistic traffic simulator, SUMO (Simulation of Urban Mobility), and the performance of the developed control strategy is evaluated under different traffic scenarios.https://www.mdpi.com/2673-7590/2/4/57deep reinforcement learningdeep deterministic policy gradientconnected and automated vehiclesvehicular platooneffective inter-vehicle distance
spellingShingle Mehran Berahman
Majid Rostami-Shahrbabaki
Klaus Bogenberger
Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
Future Transportation
deep reinforcement learning
deep deterministic policy gradient
connected and automated vehicles
vehicular platoon
effective inter-vehicle distance
title Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_full Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_fullStr Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_full_unstemmed Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_short Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_sort multi task vehicle platoon control a deep deterministic policy gradient approach
topic deep reinforcement learning
deep deterministic policy gradient
connected and automated vehicles
vehicular platoon
effective inter-vehicle distance
url https://www.mdpi.com/2673-7590/2/4/57
work_keys_str_mv AT mehranberahman multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach
AT majidrostamishahrbabaki multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach
AT klausbogenberger multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach