Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory

The personalization of autonomous vehicles or advanced driver assistance systems has been a widely researched topic, with many proposals aiming to achieve human-like or driver-imitating methods. However, these approaches rely on an implicit assumption that all drivers prefer the vehicle to drive lik...

Full description

Bibliographic Details
Main Authors: Wei Ran, Hui Chen, Taokai Xia, Yosuke Nishimura, Chaopeng Guo, Youyu Yin
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/11/5246
_version_ 1797596688884432896
author Wei Ran
Hui Chen
Taokai Xia
Yosuke Nishimura
Chaopeng Guo
Youyu Yin
author_facet Wei Ran
Hui Chen
Taokai Xia
Yosuke Nishimura
Chaopeng Guo
Youyu Yin
author_sort Wei Ran
collection DOAJ
description The personalization of autonomous vehicles or advanced driver assistance systems has been a widely researched topic, with many proposals aiming to achieve human-like or driver-imitating methods. However, these approaches rely on an implicit assumption that all drivers prefer the vehicle to drive like themselves, which may not hold true for all drivers. To address this issue, this study proposes an online personalized preference learning method (OPPLM) that utilizes a pairwise comparison group preference query and the Bayesian approach. The proposed OPPLM adopts a two-layer hierarchical structure model based on utility theory to represent driver preferences on the trajectory. To improve the accuracy of learning, the uncertainty of driver query answers is modeled. In addition, informative query and greedy query selection methods are used to improve learning speed. To determine when the driver’s preferred trajectory has been found, a convergence criterion is proposed. To evaluate the effectiveness of the OPPLM, a user study is conducted to learn the driver’s preferred trajectory in the curve of the lane centering control (LCC) system. The results show that the OPPLM can converge quickly, requiring only about 11 queries on average. Moreover, it accurately learned the driver’s favorite trajectory, and the estimated utility of the driver preference model is highly consistent with the subject evaluation score.
first_indexed 2024-03-11T02:56:35Z
format Article
id doaj.art-4c28e4d1e5a04793b1d34f3ec7b8b648
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-11T02:56:35Z
publishDate 2023-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-4c28e4d1e5a04793b1d34f3ec7b8b6482023-11-18T08:34:32ZengMDPI AGSensors1424-82202023-05-012311524610.3390/s23115246Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control TrajectoryWei Ran0Hui Chen1Taokai Xia2Yosuke Nishimura3Chaopeng Guo4Youyu Yin5School of Automotive Studies, Tongji University, Shanghai 201804, ChinaSchool of Automotive Studies, Tongji University, Shanghai 201804, ChinaSchool of Automotive Studies, Tongji University, Shanghai 201804, ChinaJTEKT Corporation, Nara 634-8555, JapanJTEKT Corporation, Nara 634-8555, JapanJTEKT Research and Development Center (WUXI) Co., Ltd., Wuxi 214161, ChinaThe personalization of autonomous vehicles or advanced driver assistance systems has been a widely researched topic, with many proposals aiming to achieve human-like or driver-imitating methods. However, these approaches rely on an implicit assumption that all drivers prefer the vehicle to drive like themselves, which may not hold true for all drivers. To address this issue, this study proposes an online personalized preference learning method (OPPLM) that utilizes a pairwise comparison group preference query and the Bayesian approach. The proposed OPPLM adopts a two-layer hierarchical structure model based on utility theory to represent driver preferences on the trajectory. To improve the accuracy of learning, the uncertainty of driver query answers is modeled. In addition, informative query and greedy query selection methods are used to improve learning speed. To determine when the driver’s preferred trajectory has been found, a convergence criterion is proposed. To evaluate the effectiveness of the OPPLM, a user study is conducted to learn the driver’s preferred trajectory in the curve of the lane centering control (LCC) system. The results show that the OPPLM can converge quickly, requiring only about 11 queries on average. Moreover, it accurately learned the driver’s favorite trajectory, and the estimated utility of the driver preference model is highly consistent with the subject evaluation score.https://www.mdpi.com/1424-8220/23/11/5246online learningpreference learningutility theoryBayesian approachLCC trajectory
spellingShingle Wei Ran
Hui Chen
Taokai Xia
Yosuke Nishimura
Chaopeng Guo
Youyu Yin
Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
Sensors
online learning
preference learning
utility theory
Bayesian approach
LCC trajectory
title Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
title_full Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
title_fullStr Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
title_full_unstemmed Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
title_short Online Personalized Preference Learning Method Based on In-Formative Query for Lane Centering Control Trajectory
title_sort online personalized preference learning method based on in formative query for lane centering control trajectory
topic online learning
preference learning
utility theory
Bayesian approach
LCC trajectory
url https://www.mdpi.com/1424-8220/23/11/5246
work_keys_str_mv AT weiran onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory
AT huichen onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory
AT taokaixia onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory
AT yosukenishimura onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory
AT chaopengguo onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory
AT youyuyin onlinepersonalizedpreferencelearningmethodbasedoninformativequeryforlanecenteringcontroltrajectory