Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations

Despite the substantial advancements in reinforcement learning (RL) in recent years, ensuring trustworthiness remains a formidable challenge when applying this technology to safety-critical autonomous driving domains. One pivotal bottleneck is that well-trained driving policy models may be particula...

Full description

Bibliographic Details
Main Authors:	He, Xiangkun, Huang, Wenhui, Lv, Chen
Other Authors:	School of Mechanical and Aerospace Engineering
Format:	Journal Article
Language:	English
Published:	2024
Subjects:	Engineering Autonomous vehicle Traffic safety
Online Access:	https://hdl.handle.net/10356/179385

_version_	1826117298067341312
author	He, Xiangkun Huang, Wenhui Lv, Chen
author2	School of Mechanical and Aerospace Engineering
author_facet	School of Mechanical and Aerospace Engineering He, Xiangkun Huang, Wenhui Lv, Chen
author_sort	He, Xiangkun
collection	NTU
description	Despite the substantial advancements in reinforcement learning (RL) in recent years, ensuring trustworthiness remains a formidable challenge when applying this technology to safety-critical autonomous driving domains. One pivotal bottleneck is that well-trained driving policy models may be particularly vulnerable to observational perturbations or perceptual uncertainties, potentially leading to severe failures. In view of this, we present a novel defense-aware robust RL approach tailored for ensuring the robustness and safety of autonomous vehicles in the face of worst-case attacks on observations. The proposed paradigm primarily comprises two crucial modules: an adversarial attacker and a robust defender. Specifically, the adversarial attacker is devised to approximate the worst-case observational perturbations that attempt to induce safety violations (e.g., collisions) in the RL-driven autonomous vehicle. Additionally, the robust defender is developed to facilitate the safe RL agent to learn robust optimal policies that maximize the return while constraining the policy and cost perturbed by the adversarial attacker within specified bounds. Finally, the proposed technique is assessed across three distinct traffic scenarios: highway, on-ramp, and intersection. The simulation and experimental results indicate that our scheme enables the agent to execute trustworthy driving policies, even in the presence of the worst-case observational perturbations.
first_indexed	2024-10-01T04:25:19Z
format	Journal Article
id	ntu-10356/179385
institution	Nanyang Technological University
language	English
last_indexed	2024-10-01T04:25:19Z
publishDate	2024
record_format	dspace
spelling	ntu-10356/1793852024-07-29T05:25:00Z Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations He, Xiangkun Huang, Wenhui Lv, Chen School of Mechanical and Aerospace Engineering Engineering Autonomous vehicle Traffic safety Despite the substantial advancements in reinforcement learning (RL) in recent years, ensuring trustworthiness remains a formidable challenge when applying this technology to safety-critical autonomous driving domains. One pivotal bottleneck is that well-trained driving policy models may be particularly vulnerable to observational perturbations or perceptual uncertainties, potentially leading to severe failures. In view of this, we present a novel defense-aware robust RL approach tailored for ensuring the robustness and safety of autonomous vehicles in the face of worst-case attacks on observations. The proposed paradigm primarily comprises two crucial modules: an adversarial attacker and a robust defender. Specifically, the adversarial attacker is devised to approximate the worst-case observational perturbations that attempt to induce safety violations (e.g., collisions) in the RL-driven autonomous vehicle. Additionally, the robust defender is developed to facilitate the safe RL agent to learn robust optimal policies that maximize the return while constraining the policy and cost perturbed by the adversarial attacker within specified bounds. Finally, the proposed technique is assessed across three distinct traffic scenarios: highway, on-ramp, and intersection. The simulation and experimental results indicate that our scheme enables the agent to execute trustworthy driving policies, even in the presence of the worst-case observational perturbations. Agency for Science, Technology and Research (ASTAR) Ministry of Education (MOE) National Research Foundation (NRF) This work was supported in part by the Agency for Science, Technology and Research (ASTAR), Singapore, under Advanced Manufacturing and Engineering (AME) Young Individual Research under Grant A2084c0156, the MTC Individual Research Grant (M22K2c0079), the ANR-NRF Joint Grant (No. NRF2021-NRF-ANR003 HM Science), and the Ministry of Education (MOE), Singapore, under the Tier 2 Grant (MOE-T2EP50222-0002). 2024-07-29T05:25:00Z 2024-07-29T05:25:00Z 2024 Journal Article He, X., Huang, W. & Lv, C. (2024). Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations. Transportation Research Part C: Emerging Technologies, 163, 104632-. https://dx.doi.org/10.1016/j.trc.2024.104632 0968-090X https://hdl.handle.net/10356/179385 10.1016/j.trc.2024.104632 2-s2.0-85191985184 163 104632 en A2084c0156 M22K2c0079 NRF2021-NRF-ANR003 HM Science MOE-T2EP50222-0002 Transportation Research Part C: Emerging Technologies © 2024 Elsevier Ltd. All rights reserved.
spellingShingle	Engineering Autonomous vehicle Traffic safety He, Xiangkun Huang, Wenhui Lv, Chen Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title	Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title_full	Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title_fullStr	Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title_full_unstemmed	Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title_short	Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations
title_sort	trustworthy autonomous driving via defense aware robust reinforcement learning against worst case observational perturbations
topic	Engineering Autonomous vehicle Traffic safety
url	https://hdl.handle.net/10356/179385
work_keys_str_mv	AT hexiangkun trustworthyautonomousdrivingviadefenseawarerobustreinforcementlearningagainstworstcaseobservationalperturbations AT huangwenhui trustworthyautonomousdrivingviadefenseawarerobustreinforcementlearningagainstworstcaseobservationalperturbations AT lvchen trustworthyautonomousdrivingviadefenseawarerobustreinforcementlearningagainstworstcaseobservationalperturbations

Trustworthy autonomous driving via defense-aware robust reinforcement learning against worst-case observational perturbations

Similar Items