An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants

Rapid industrialization and urban development are the main causes of air pollution, leading to daily air quality and health problems. To find significant pollutants and forecast their concentrations, in this study, we used a hybrid methodology, including integrated variable selection, autoregressive...

Full description

Bibliographic Details
Main Authors: Ching-Hsue Cheng, Ming-Chi Tsai
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:Atmosphere
Subjects:
Online Access:https://www.mdpi.com/2073-4433/13/7/1055
_version_ 1797440946773688320
author Ching-Hsue Cheng
Ming-Chi Tsai
author_facet Ching-Hsue Cheng
Ming-Chi Tsai
author_sort Ching-Hsue Cheng
collection DOAJ
description Rapid industrialization and urban development are the main causes of air pollution, leading to daily air quality and health problems. To find significant pollutants and forecast their concentrations, in this study, we used a hybrid methodology, including integrated variable selection, autoregressive distributed lag, and deleted multiple collinear variables to reduce variables, and then applied six intelligent time series models to forecast the concentrations of the top three pollution sources. We collected two air quality datasets from traffic and industrial monitoring stations and weather data to analyze and compare their results. The results show that a random forest based on selected key variables has better classification metrics (accuracy, AUC, recall, precision, and F1). After deleting the collinearity of the independent variables and adding the lag periods using the autoregressive distributed lag model, the intelligent time-series support vector regression was found to have better forecasting performance (RMSE and MAE). Finally, the research results could be used as a reference by all relevant stakeholders and help respond to poor air quality.
first_indexed 2024-03-09T12:15:47Z
format Article
id doaj.art-5eeaff4172d74008bac3cc9755b3d89d
institution Directory Open Access Journal
issn 2073-4433
language English
last_indexed 2024-03-09T12:15:47Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series Atmosphere
spelling doaj.art-5eeaff4172d74008bac3cc9755b3d89d2023-11-30T22:46:42ZengMDPI AGAtmosphere2073-44332022-07-01137105510.3390/atmos13071055An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air PollutantsChing-Hsue Cheng0Ming-Chi Tsai1Department of Information Management, National Yunlin University of Science and Technology, Douliu 640301, TaiwanDepartment of Business Administration, I-Shou University; Kaohsiung City 840301, TaiwanRapid industrialization and urban development are the main causes of air pollution, leading to daily air quality and health problems. To find significant pollutants and forecast their concentrations, in this study, we used a hybrid methodology, including integrated variable selection, autoregressive distributed lag, and deleted multiple collinear variables to reduce variables, and then applied six intelligent time series models to forecast the concentrations of the top three pollution sources. We collected two air quality datasets from traffic and industrial monitoring stations and weather data to analyze and compare their results. The results show that a random forest based on selected key variables has better classification metrics (accuracy, AUC, recall, precision, and F1). After deleting the collinearity of the independent variables and adding the lag periods using the autoregressive distributed lag model, the intelligent time-series support vector regression was found to have better forecasting performance (RMSE and MAE). Finally, the research results could be used as a reference by all relevant stakeholders and help respond to poor air quality.https://www.mdpi.com/2073-4433/13/7/1055air pollutionvariable selectionautoregressive distributed lag modelair quality rulestime-series forecast model
spellingShingle Ching-Hsue Cheng
Ming-Chi Tsai
An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
Atmosphere
air pollution
variable selection
autoregressive distributed lag model
air quality rules
time-series forecast model
title An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
title_full An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
title_fullStr An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
title_full_unstemmed An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
title_short An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants
title_sort intelligent time series model based on hybrid methodology for forecasting concentrations of significant air pollutants
topic air pollution
variable selection
autoregressive distributed lag model
air quality rules
time-series forecast model
url https://www.mdpi.com/2073-4433/13/7/1055
work_keys_str_mv AT chinghsuecheng anintelligenttimeseriesmodelbasedonhybridmethodologyforforecastingconcentrationsofsignificantairpollutants
AT mingchitsai anintelligenttimeseriesmodelbasedonhybridmethodologyforforecastingconcentrationsofsignificantairpollutants
AT chinghsuecheng intelligenttimeseriesmodelbasedonhybridmethodologyforforecastingconcentrationsofsignificantairpollutants
AT mingchitsai intelligenttimeseriesmodelbasedonhybridmethodologyforforecastingconcentrationsofsignificantairpollutants