PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism

The concentration of PM2.5 is an important index to measure the degree of air pollution. When it exceeds the standard value, it is considered to cause pollution and lower the air quality, which is harmful to human health and can cause a variety of diseases, i.e., asthma, chronic bronchitis, etc. The...

Full description

Bibliographic Details
Main Authors: Jinsong Zhang, Yongtao Peng, Bo Ren, Taoying Li
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/14/7/208
_version_ 1797527778352955392
author Jinsong Zhang
Yongtao Peng
Bo Ren
Taoying Li
author_facet Jinsong Zhang
Yongtao Peng
Bo Ren
Taoying Li
author_sort Jinsong Zhang
collection DOAJ
description The concentration of PM2.5 is an important index to measure the degree of air pollution. When it exceeds the standard value, it is considered to cause pollution and lower the air quality, which is harmful to human health and can cause a variety of diseases, i.e., asthma, chronic bronchitis, etc. Therefore, the prediction of PM2.5 concentration is helpful to reduce its harm. In this paper, a hybrid model called CNN-BiLSTM-Attention is proposed to predict the PM2.5 concentration over the next two days. First, we select the PM2.5 concentration data in hours from January 2013 to February 2017 of Shunyi District, Beijing. The auxiliary data includes air quality data and meteorological data. We use the sliding window method for preprocessing and dividing the corresponding data into a training set, a validation set, and a test set. Second, CNN-BiLSTM-Attention is composed of the convolutional neural network, bidirectional long short-term memory neural network, and attention mechanism. The parameters of this network structure are determined by the minimum error in the training process, including the size of the convolution kernel, activation function, batch size, dropout rate, learning rate, etc. We determine the feature size of the input and output by evaluating the performance of the model, finding out the best output for the next 48 h. Third, in the experimental part, we use the test set to check the performance of the proposed CNN-BiLSTM-Attention on PM2.5 prediction, which is compared by other comparison models, i.e., lasso regression, ridge regression, XGBOOST, SVR, CNN-LSTM, and CNN-BiLSTM. We conduct short-term prediction (48 h) and long-term prediction (72 h, 96 h, 120 h, 144 h), respectively. The results demonstrate that even the predictions of the next 144 h with CNN-BiLSTM-Attention is better than the predictions of the next 48 h with the comparison models in terms of mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (<i>R</i><sup>2</sup>).
first_indexed 2024-03-10T09:47:39Z
format Article
id doaj.art-213acdf0f04949e09e93cc4fd3d9a701
institution Directory Open Access Journal
issn 1999-4893
language English
last_indexed 2024-03-10T09:47:39Z
publishDate 2021-07-01
publisher MDPI AG
record_format Article
series Algorithms
spelling doaj.art-213acdf0f04949e09e93cc4fd3d9a7012023-11-22T02:59:28ZengMDPI AGAlgorithms1999-48932021-07-0114720810.3390/a14070208PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention MechanismJinsong Zhang0Yongtao Peng1Bo Ren2Taoying Li3School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaThe concentration of PM2.5 is an important index to measure the degree of air pollution. When it exceeds the standard value, it is considered to cause pollution and lower the air quality, which is harmful to human health and can cause a variety of diseases, i.e., asthma, chronic bronchitis, etc. Therefore, the prediction of PM2.5 concentration is helpful to reduce its harm. In this paper, a hybrid model called CNN-BiLSTM-Attention is proposed to predict the PM2.5 concentration over the next two days. First, we select the PM2.5 concentration data in hours from January 2013 to February 2017 of Shunyi District, Beijing. The auxiliary data includes air quality data and meteorological data. We use the sliding window method for preprocessing and dividing the corresponding data into a training set, a validation set, and a test set. Second, CNN-BiLSTM-Attention is composed of the convolutional neural network, bidirectional long short-term memory neural network, and attention mechanism. The parameters of this network structure are determined by the minimum error in the training process, including the size of the convolution kernel, activation function, batch size, dropout rate, learning rate, etc. We determine the feature size of the input and output by evaluating the performance of the model, finding out the best output for the next 48 h. Third, in the experimental part, we use the test set to check the performance of the proposed CNN-BiLSTM-Attention on PM2.5 prediction, which is compared by other comparison models, i.e., lasso regression, ridge regression, XGBOOST, SVR, CNN-LSTM, and CNN-BiLSTM. We conduct short-term prediction (48 h) and long-term prediction (72 h, 96 h, 120 h, 144 h), respectively. The results demonstrate that even the predictions of the next 144 h with CNN-BiLSTM-Attention is better than the predictions of the next 48 h with the comparison models in terms of mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (<i>R</i><sup>2</sup>).https://www.mdpi.com/1999-4893/14/7/208deep learningCNNBiLSTMattention mechanismPM2.5 concentration prediction
spellingShingle Jinsong Zhang
Yongtao Peng
Bo Ren
Taoying Li
PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
Algorithms
deep learning
CNN
BiLSTM
attention mechanism
PM2.5 concentration prediction
title PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
title_full PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
title_fullStr PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
title_full_unstemmed PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
title_short PM2.5 Concentration Prediction Based on CNN-BiLSTM and Attention Mechanism
title_sort pm2 5 concentration prediction based on cnn bilstm and attention mechanism
topic deep learning
CNN
BiLSTM
attention mechanism
PM2.5 concentration prediction
url https://www.mdpi.com/1999-4893/14/7/208
work_keys_str_mv AT jinsongzhang pm25concentrationpredictionbasedoncnnbilstmandattentionmechanism
AT yongtaopeng pm25concentrationpredictionbasedoncnnbilstmandattentionmechanism
AT boren pm25concentrationpredictionbasedoncnnbilstmandattentionmechanism
AT taoyingli pm25concentrationpredictionbasedoncnnbilstmandattentionmechanism