Prediction of flight departure delays caused by weather conditions adopting data-driven approaches
Abstract In this study, we utilize data-driven approaches to predict flight departure delays. The growing demand for air travel is outpacing the capacity and infrastructure available to support it. In addition, abnormal weather patterns caused by climate change contribute to the frequent occurrence...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2024-01-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40537-023-00867-5 |
_version_ | 1797355741179281408 |
---|---|
author | Seongeun Kim Eunil Park |
author_facet | Seongeun Kim Eunil Park |
author_sort | Seongeun Kim |
collection | DOAJ |
description | Abstract In this study, we utilize data-driven approaches to predict flight departure delays. The growing demand for air travel is outpacing the capacity and infrastructure available to support it. In addition, abnormal weather patterns caused by climate change contribute to the frequent occurrence of flight delays. In light of the extensive network of international flights covering vast distances across continents and oceans, the importance of forecasting flight delays over extended time periods becomes increasingly evident. Existing research has predominantly concentrated on short-term predictions, prompting our study to specifically address this aspect. We collected datasets spanning over 10 years from three different airports such as ICN airport in South Korea, JFK and MDW airport in the United States, capturing flight information at six different time intervals (2, 4, 8, 16, 24, and 48 h) prior to flight departure. The datasets comprise 1,569,879 instances for ICN, 773,347 for JFK, and 404,507 for MDW, respectively. We employed a range of machine learning and deep learning approaches, including Decision Tree, Random Forest, Support Vector Machine, K-nearest neighbors, Logistic Regression, Extreme Gradient Boosting, and Long Short-Term Memory, to predict flight delays. Our models achieved accuracy rates of 0.749 for ICN airport, 0.852 for JFK airport, and 0.785 for MDW airport in 2-h predictions. Furthermore, for 48-h predictions, our models achieved accuracy rates of 0.748 for ICN airport, 0.846 for JFK airport, and 0.772 for MDW airport based on our experimental results. Consequently, we have successfully validated the accuracy of flight delay predictions for longer time frames. The implications and future research directions derived from these findings are also discussed. |
first_indexed | 2024-03-08T14:15:24Z |
format | Article |
id | doaj.art-aed02de08a8f46ebb7ec633b0bb1dee1 |
institution | Directory Open Access Journal |
issn | 2196-1115 |
language | English |
last_indexed | 2024-03-08T14:15:24Z |
publishDate | 2024-01-01 |
publisher | SpringerOpen |
record_format | Article |
series | Journal of Big Data |
spelling | doaj.art-aed02de08a8f46ebb7ec633b0bb1dee12024-01-14T12:26:18ZengSpringerOpenJournal of Big Data2196-11152024-01-0111112510.1186/s40537-023-00867-5Prediction of flight departure delays caused by weather conditions adopting data-driven approachesSeongeun Kim0Eunil Park1Department of Semiconductor and Display Engineering, Sungkyunkwan UniversityDepartment of Interaction Science, Sungkyunkwan UniversityAbstract In this study, we utilize data-driven approaches to predict flight departure delays. The growing demand for air travel is outpacing the capacity and infrastructure available to support it. In addition, abnormal weather patterns caused by climate change contribute to the frequent occurrence of flight delays. In light of the extensive network of international flights covering vast distances across continents and oceans, the importance of forecasting flight delays over extended time periods becomes increasingly evident. Existing research has predominantly concentrated on short-term predictions, prompting our study to specifically address this aspect. We collected datasets spanning over 10 years from three different airports such as ICN airport in South Korea, JFK and MDW airport in the United States, capturing flight information at six different time intervals (2, 4, 8, 16, 24, and 48 h) prior to flight departure. The datasets comprise 1,569,879 instances for ICN, 773,347 for JFK, and 404,507 for MDW, respectively. We employed a range of machine learning and deep learning approaches, including Decision Tree, Random Forest, Support Vector Machine, K-nearest neighbors, Logistic Regression, Extreme Gradient Boosting, and Long Short-Term Memory, to predict flight delays. Our models achieved accuracy rates of 0.749 for ICN airport, 0.852 for JFK airport, and 0.785 for MDW airport in 2-h predictions. Furthermore, for 48-h predictions, our models achieved accuracy rates of 0.748 for ICN airport, 0.846 for JFK airport, and 0.772 for MDW airport based on our experimental results. Consequently, we have successfully validated the accuracy of flight delay predictions for longer time frames. The implications and future research directions derived from these findings are also discussed.https://doi.org/10.1186/s40537-023-00867-5Flight delayDelay prediction weatherMachine learningLSTM |
spellingShingle | Seongeun Kim Eunil Park Prediction of flight departure delays caused by weather conditions adopting data-driven approaches Journal of Big Data Flight delay Delay prediction weather Machine learning LSTM |
title | Prediction of flight departure delays caused by weather conditions adopting data-driven approaches |
title_full | Prediction of flight departure delays caused by weather conditions adopting data-driven approaches |
title_fullStr | Prediction of flight departure delays caused by weather conditions adopting data-driven approaches |
title_full_unstemmed | Prediction of flight departure delays caused by weather conditions adopting data-driven approaches |
title_short | Prediction of flight departure delays caused by weather conditions adopting data-driven approaches |
title_sort | prediction of flight departure delays caused by weather conditions adopting data driven approaches |
topic | Flight delay Delay prediction weather Machine learning LSTM |
url | https://doi.org/10.1186/s40537-023-00867-5 |
work_keys_str_mv | AT seongeunkim predictionofflightdeparturedelayscausedbyweatherconditionsadoptingdatadrivenapproaches AT eunilpark predictionofflightdeparturedelayscausedbyweatherconditionsadoptingdatadrivenapproaches |