Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges
Missing data in Intelligent Transportation Systems (ITS) could lead to possible errors in the analyses of traffic data. Applying Artificial Intelligence (AI) in these circumstances can mitigate such problems. Past works focused only on specific data imputation methods, such as tensor factorization o...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10091533/ |
_version_ | 1797849293348929536 |
---|---|
author | Robin Kuok Cheong Chan Joanne Mun-Yee Lim Rajendran Parthiban |
author_facet | Robin Kuok Cheong Chan Joanne Mun-Yee Lim Rajendran Parthiban |
author_sort | Robin Kuok Cheong Chan |
collection | DOAJ |
description | Missing data in Intelligent Transportation Systems (ITS) could lead to possible errors in the analyses of traffic data. Applying Artificial Intelligence (AI) in these circumstances can mitigate such problems. Past works focused only on specific data imputation methods, such as tensor factorization or a specific neural network model. While there are review papers covering singular topics regarding missing data, there are none in the field of traffic, to the best of our knowledge, that introduces the process of missing data collection and the viability of the traffic data collected while also broadly covering the popularly used models of recent years. This has led to non-uniformity of the terms used in missing data imputation, limited research in areas where datasets are not available, and a narrowed view of the methods used for data imputation. Hence, this paper aims to standardize the terms used in missing data classifications, look into the limitations of using available public or private datasets for urban traffic research, and discuss popular statistical and data-driven methods used by recent AI and ITS papers. It was found that tensor decomposition-based methods are the most popular for missing data imputation, followed by Generative Adversarial Networks and Graph Neural Networks, all of which rely on a large training dataset. Meanwhile, Probability Principle Component Analysis (PPCA) methods provide valuable insights via traffic analysis and are used for real-time traffic imputation. This paper also highlights the need for more efficient and reliable methods for traffic data collection, such as online APIs. |
first_indexed | 2024-04-09T18:41:31Z |
format | Article |
id | doaj.art-3446249473864cbb9138c0c5a906e8b5 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-09T18:41:31Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-3446249473864cbb9138c0c5a906e8b52023-04-10T23:01:05ZengIEEEIEEE Access2169-35362023-01-0111340803409310.1109/ACCESS.2023.326421610091533Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and ChallengesRobin Kuok Cheong Chan0https://orcid.org/0000-0002-0156-8648Joanne Mun-Yee Lim1https://orcid.org/0000-0002-1326-8634Rajendran Parthiban2https://orcid.org/0000-0003-0983-9796School of Engineering, Monash University Malaysia, Bandar Sunway, Selangor, MalaysiaSchool of Engineering, Monash University Malaysia, Bandar Sunway, Selangor, MalaysiaDepartment of Electrical and Computer Systems Engineering, Faculty of Engineering, Monash University, Clayton, VIC, AustraliaMissing data in Intelligent Transportation Systems (ITS) could lead to possible errors in the analyses of traffic data. Applying Artificial Intelligence (AI) in these circumstances can mitigate such problems. Past works focused only on specific data imputation methods, such as tensor factorization or a specific neural network model. While there are review papers covering singular topics regarding missing data, there are none in the field of traffic, to the best of our knowledge, that introduces the process of missing data collection and the viability of the traffic data collected while also broadly covering the popularly used models of recent years. This has led to non-uniformity of the terms used in missing data imputation, limited research in areas where datasets are not available, and a narrowed view of the methods used for data imputation. Hence, this paper aims to standardize the terms used in missing data classifications, look into the limitations of using available public or private datasets for urban traffic research, and discuss popular statistical and data-driven methods used by recent AI and ITS papers. It was found that tensor decomposition-based methods are the most popular for missing data imputation, followed by Generative Adversarial Networks and Graph Neural Networks, all of which rely on a large training dataset. Meanwhile, Probability Principle Component Analysis (PPCA) methods provide valuable insights via traffic analysis and are used for real-time traffic imputation. This paper also highlights the need for more efficient and reliable methods for traffic data collection, such as online APIs.https://ieeexplore.ieee.org/document/10091533/Intelligent transportation systemsartificial intelligencecommunication system operations and managementreviews |
spellingShingle | Robin Kuok Cheong Chan Joanne Mun-Yee Lim Rajendran Parthiban Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges IEEE Access Intelligent transportation systems artificial intelligence communication system operations and management reviews |
title | Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges |
title_full | Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges |
title_fullStr | Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges |
title_full_unstemmed | Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges |
title_short | Missing Traffic Data Imputation for Artificial Intelligence in Intelligent Transportation Systems: Review of Methods, Limitations, and Challenges |
title_sort | missing traffic data imputation for artificial intelligence in intelligent transportation systems review of methods limitations and challenges |
topic | Intelligent transportation systems artificial intelligence communication system operations and management reviews |
url | https://ieeexplore.ieee.org/document/10091533/ |
work_keys_str_mv | AT robinkuokcheongchan missingtrafficdataimputationforartificialintelligenceinintelligenttransportationsystemsreviewofmethodslimitationsandchallenges AT joannemunyeelim missingtrafficdataimputationforartificialintelligenceinintelligenttransportationsystemsreviewofmethodslimitationsandchallenges AT rajendranparthiban missingtrafficdataimputationforartificialintelligenceinintelligenttransportationsystemsreviewofmethodslimitationsandchallenges |