Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks

Forecasting the trajectory of pedestrians in shared urban traffic environments from non-invasive sensor modalities is still considered one of the challenging problems facing the development of autonomous vehicles (AVs). In the literature, this problem is often tackled using recurrent neural networks...

Full description

Bibliographic Details
Main Author:	Khaled Saleh
Format:	Article
Language:	English
Published:	MDPI AG 2022-10-01
Series:	Sensors
Subjects:	pedestrian trajectory intent autonomous vehicles
Online Access:	https://www.mdpi.com/1424-8220/22/19/7495

_version_	1797476892317581312
author	Khaled Saleh
author_facet	Khaled Saleh
author_sort	Khaled Saleh
collection	DOAJ
description	Forecasting the trajectory of pedestrians in shared urban traffic environments from non-invasive sensor modalities is still considered one of the challenging problems facing the development of autonomous vehicles (AVs). In the literature, this problem is often tackled using recurrent neural networks (RNNs). Despite the powerful capabilities of RNNs in capturing the temporal dependency in the pedestrians’ motion trajectories, they were argued to be challenged when dealing with longer sequential data. Additionally, whilst the accommodation for contextual information (such as scene semantics and agents interactions) was shown to be effective for robust trajectory prediction, they can also impact the overall real-time performance of prediction system. Thus, in this work, we are introducing a framework based on the transformer networks that were demonstrated recently to be more efficient and outperformed RNNs in many sequential-based tasks. We relied on a fusion of sensor modalities, namely the past positional information, agent interactions information and scene physical semantics information as an input to our framework in order to not only provide a robust trajectory prediction of pedestrians, but also achieve real-time performance for multi-pedestrians’ trajectory prediction. We have evaluated our framework on three real-life datasets of pedestrians in shared urban traffic environments and it has outperformed the compared baseline approaches in both short-term and long-term prediction horizons. For the short-term prediction horizon, our approach has achieved lower scores according to the average displacement error and the root-mean squared error (ADE/RMSE) of predictions over the state-of-the art (SOTA) approach by more than 11 cm and 23 cm, respectively. While for the long-term prediction horizon, our approach has achieved lower ADE and FDE over the SOTA approach by more than 62 cm and 165 cm, respectively. Additionally, our approach has achieved superior real time performance by scoring only 0.025 s (i.e., it can provide 40 individual trajectory predictions per second).
first_indexed	2024-03-09T21:10:12Z
format	Article
id	doaj.art-827a16cbcd5e456b8d8d13b3543626dc
institution	Directory Open Access Journal
issn	1424-8220
language	English
last_indexed	2024-03-09T21:10:12Z
publishDate	2022-10-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj.art-827a16cbcd5e456b8d8d13b3543626dc2023-11-23T21:50:04ZengMDPI AGSensors1424-82202022-10-012219749510.3390/s22197495Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer NetworksKhaled Saleh0School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW 2308, AustraliaForecasting the trajectory of pedestrians in shared urban traffic environments from non-invasive sensor modalities is still considered one of the challenging problems facing the development of autonomous vehicles (AVs). In the literature, this problem is often tackled using recurrent neural networks (RNNs). Despite the powerful capabilities of RNNs in capturing the temporal dependency in the pedestrians’ motion trajectories, they were argued to be challenged when dealing with longer sequential data. Additionally, whilst the accommodation for contextual information (such as scene semantics and agents interactions) was shown to be effective for robust trajectory prediction, they can also impact the overall real-time performance of prediction system. Thus, in this work, we are introducing a framework based on the transformer networks that were demonstrated recently to be more efficient and outperformed RNNs in many sequential-based tasks. We relied on a fusion of sensor modalities, namely the past positional information, agent interactions information and scene physical semantics information as an input to our framework in order to not only provide a robust trajectory prediction of pedestrians, but also achieve real-time performance for multi-pedestrians’ trajectory prediction. We have evaluated our framework on three real-life datasets of pedestrians in shared urban traffic environments and it has outperformed the compared baseline approaches in both short-term and long-term prediction horizons. For the short-term prediction horizon, our approach has achieved lower scores according to the average displacement error and the root-mean squared error (ADE/RMSE) of predictions over the state-of-the art (SOTA) approach by more than 11 cm and 23 cm, respectively. While for the long-term prediction horizon, our approach has achieved lower ADE and FDE over the SOTA approach by more than 62 cm and 165 cm, respectively. Additionally, our approach has achieved superior real time performance by scoring only 0.025 s (i.e., it can provide 40 individual trajectory predictions per second).https://www.mdpi.com/1424-8220/22/19/7495pedestriantrajectoryintentautonomous vehicles
spellingShingle	Khaled Saleh Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks Sensors pedestrian trajectory intent autonomous vehicles
title	Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
title_full	Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
title_fullStr	Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
title_full_unstemmed	Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
title_short	Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks
title_sort	pedestrian trajectory prediction for real time autonomous systems via context augmented transformer networks
topic	pedestrian trajectory intent autonomous vehicles
url	https://www.mdpi.com/1424-8220/22/19/7495
work_keys_str_mv	AT khaledsaleh pedestriantrajectorypredictionforrealtimeautonomoussystemsviacontextaugmentedtransformernetworks

Pedestrian Trajectory Prediction for Real-Time Autonomous Systems via Context-Augmented Transformer Networks

Similar Items