Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach

Several issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. N...

Full description

Bibliographic Details
Main Authors:	Mehran Berahman, Majid Rostami-Shahrbabaki, Klaus Bogenberger
Format:	Article
Language:	English
Published:	MDPI AG 2022-12-01
Series:	Future Transportation
Subjects:	deep reinforcement learning deep deterministic policy gradient connected and automated vehicles vehicular platoon effective inter-vehicle distance
Online Access:	https://www.mdpi.com/2673-7590/2/4/57

_version_	1797458106526990336
author	Mehran Berahman Majid Rostami-Shahrbabaki Klaus Bogenberger
author_facet	Mehran Berahman Majid Rostami-Shahrbabaki Klaus Bogenberger
author_sort	Mehran Berahman
collection	DOAJ
description	Several issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. Nonetheless, the lack of research on designing a single algorithm capable of simultaneously coping with speed-tracking and maintaining a secure headway, as well as the gap-closing/opening problems, is apparent. As deep reinforcement learning (DRL) applications in driving strategies are promising, this paper develops a multi-task deep deterministic policy gradient (DDPG) car-following algorithm in a platoon system. The proposed approach combines gap closing/opening with a unified platoon control strategy; as such, an effective virtual inter-vehicle distance is employed in the developed DRL-based platoon controller reward. This innovative new distance definition, which is based on the action taken by the ego-vehicle, leads to a precise comprehension of the agent’s actions. Moreover, by imposing a specific constraint on a variation of the ego-vehicle’s relative speed with respect to its predecessor, the speed chattering of the ego-vehicle is reduced. The developed algorithm is implemented in the realistic traffic simulator, SUMO (Simulation of Urban Mobility), and the performance of the developed control strategy is evaluated under different traffic scenarios.
first_indexed	2024-03-09T16:33:21Z
format	Article
id	doaj.art-70d7c1d324124af6be9615490823e013
institution	Directory Open Access Journal
issn	2673-7590
language	English
last_indexed	2024-03-09T16:33:21Z
publishDate	2022-12-01
publisher	MDPI AG
record_format	Article
series	Future Transportation
spelling	doaj.art-70d7c1d324124af6be9615490823e0132023-11-24T14:59:28ZengMDPI AGFuture Transportation2673-75902022-12-01241028104610.3390/futuretransp2040057Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient ApproachMehran Berahman0Majid Rostami-Shahrbabaki1Klaus Bogenberger2Department of Electrical and Computer Engineering, Shiraz University, Shiraz 51154-71348, IranChair of Traffic Engineering and Control, Technical University of Munich, 80333 Munich, GermanyChair of Traffic Engineering and Control, Technical University of Munich, 80333 Munich, GermanySeveral issues in designing a vehicle platoon control system must be considered; among them, the speed consensus and space/gap regulation between the vehicles play the primary role. In addition, reliable and fast gap-closing/opening actions are highly recommended for establishing a platoon system. Nonetheless, the lack of research on designing a single algorithm capable of simultaneously coping with speed-tracking and maintaining a secure headway, as well as the gap-closing/opening problems, is apparent. As deep reinforcement learning (DRL) applications in driving strategies are promising, this paper develops a multi-task deep deterministic policy gradient (DDPG) car-following algorithm in a platoon system. The proposed approach combines gap closing/opening with a unified platoon control strategy; as such, an effective virtual inter-vehicle distance is employed in the developed DRL-based platoon controller reward. This innovative new distance definition, which is based on the action taken by the ego-vehicle, leads to a precise comprehension of the agent’s actions. Moreover, by imposing a specific constraint on a variation of the ego-vehicle’s relative speed with respect to its predecessor, the speed chattering of the ego-vehicle is reduced. The developed algorithm is implemented in the realistic traffic simulator, SUMO (Simulation of Urban Mobility), and the performance of the developed control strategy is evaluated under different traffic scenarios.https://www.mdpi.com/2673-7590/2/4/57deep reinforcement learningdeep deterministic policy gradientconnected and automated vehiclesvehicular platooneffective inter-vehicle distance
spellingShingle	Mehran Berahman Majid Rostami-Shahrbabaki Klaus Bogenberger Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach Future Transportation deep reinforcement learning deep deterministic policy gradient connected and automated vehicles vehicular platoon effective inter-vehicle distance
title	Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_full	Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_fullStr	Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_full_unstemmed	Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_short	Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach
title_sort	multi task vehicle platoon control a deep deterministic policy gradient approach
topic	deep reinforcement learning deep deterministic policy gradient connected and automated vehicles vehicular platoon effective inter-vehicle distance
url	https://www.mdpi.com/2673-7590/2/4/57
work_keys_str_mv	AT mehranberahman multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach AT majidrostamishahrbabaki multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach AT klausbogenberger multitaskvehicleplatooncontroladeepdeterministicpolicygradientapproach

Multi-Task Vehicle Platoon Control: A Deep Deterministic Policy Gradient Approach

Similar Items