Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model

PM<sub>2.5</sub> is one of the main pollutants that cause air pollution, and high concentrations of PM<sub>2.5</sub> seriously threaten human health. Therefore, an accurate prediction of PM<sub>2.5</sub> concentration has great practical significance for air quali...

Full description

Bibliographic Details
Main Authors: Xuchu Jiang, Yiwen Luo, Biao Zhang
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Atmosphere
Subjects:
Online Access:https://www.mdpi.com/2073-4433/12/9/1211
_version_ 1797520264219590656
author Xuchu Jiang
Yiwen Luo
Biao Zhang
author_facet Xuchu Jiang
Yiwen Luo
Biao Zhang
author_sort Xuchu Jiang
collection DOAJ
description PM<sub>2.5</sub> is one of the main pollutants that cause air pollution, and high concentrations of PM<sub>2.5</sub> seriously threaten human health. Therefore, an accurate prediction of PM<sub>2.5</sub> concentration has great practical significance for air quality detection, air pollution restoration, and human health. This paper uses the historical air quality concentration data and meteorological data of the Beijing Olympic Sports Center as the research object. This paper establishes a long short-term memory (LSTM) model with a time window size of 12, establishes a T-shape light gradient boosting machine (TSLightGBM) model that uses all information in the time window as the next period of prediction input, and establishes a LSTM-TSLightGBM model pair based on an optimal weighted combination method. PM<sub>2.5</sub> hourly concentration is predicted. The prediction results on the test set show that the mean squared error (MAE), root mean squared error (RMSE), and symmetric mean absolute percentage error (SMAPE) of the LSTM-TSLightGBM model are 11.873, 22.516, and 19.540%, respectively. Compared with LSTM, TSLightGBM, the recurrent neural network (RNN), and other models, the LSTM-TSLightGBM model has a lower MAE, RMSE, and SMAPE, and higher prediction accuracy for PM<sub>2.5</sub> and better goodness-of-fit.
first_indexed 2024-03-10T07:54:19Z
format Article
id doaj.art-876594d5629e42d0ae70f2357836bcbb
institution Directory Open Access Journal
issn 2073-4433
language English
last_indexed 2024-03-10T07:54:19Z
publishDate 2021-09-01
publisher MDPI AG
record_format Article
series Atmosphere
spelling doaj.art-876594d5629e42d0ae70f2357836bcbb2023-11-22T12:00:54ZengMDPI AGAtmosphere2073-44332021-09-01129121110.3390/atmos12091211Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination ModelXuchu Jiang0Yiwen Luo1Biao Zhang2School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, ChinaSchool of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, ChinaSchool of Computer Science, Liaocheng University, Liaocheng 252059, ChinaPM<sub>2.5</sub> is one of the main pollutants that cause air pollution, and high concentrations of PM<sub>2.5</sub> seriously threaten human health. Therefore, an accurate prediction of PM<sub>2.5</sub> concentration has great practical significance for air quality detection, air pollution restoration, and human health. This paper uses the historical air quality concentration data and meteorological data of the Beijing Olympic Sports Center as the research object. This paper establishes a long short-term memory (LSTM) model with a time window size of 12, establishes a T-shape light gradient boosting machine (TSLightGBM) model that uses all information in the time window as the next period of prediction input, and establishes a LSTM-TSLightGBM model pair based on an optimal weighted combination method. PM<sub>2.5</sub> hourly concentration is predicted. The prediction results on the test set show that the mean squared error (MAE), root mean squared error (RMSE), and symmetric mean absolute percentage error (SMAPE) of the LSTM-TSLightGBM model are 11.873, 22.516, and 19.540%, respectively. Compared with LSTM, TSLightGBM, the recurrent neural network (RNN), and other models, the LSTM-TSLightGBM model has a lower MAE, RMSE, and SMAPE, and higher prediction accuracy for PM<sub>2.5</sub> and better goodness-of-fit.https://www.mdpi.com/2073-4433/12/9/1211PM<sub>2.5</sub> concentrationLSTMTSLightGBMtime windowfeature construction
spellingShingle Xuchu Jiang
Yiwen Luo
Biao Zhang
Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
Atmosphere
PM<sub>2.5</sub> concentration
LSTM
TSLightGBM
time window
feature construction
title Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
title_full Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
title_fullStr Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
title_full_unstemmed Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
title_short Prediction of PM<sub>2.5</sub> Concentration Based on the LSTM-TSLightGBM Variable Weight Combination Model
title_sort prediction of pm sub 2 5 sub concentration based on the lstm tslightgbm variable weight combination model
topic PM<sub>2.5</sub> concentration
LSTM
TSLightGBM
time window
feature construction
url https://www.mdpi.com/2073-4433/12/9/1211
work_keys_str_mv AT xuchujiang predictionofpmsub25subconcentrationbasedonthelstmtslightgbmvariableweightcombinationmodel
AT yiwenluo predictionofpmsub25subconcentrationbasedonthelstmtslightgbmvariableweightcombinationmodel
AT biaozhang predictionofpmsub25subconcentrationbasedonthelstmtslightgbmvariableweightcombinationmodel