Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning

Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accu...

Full description

Bibliographic Details
Main Authors: Byoung Gook Loh, Gi Heung Choi
Format: Article
Language:English
Published: Elsevier 2019-12-01
Series:Safety and Health at Work
Online Access:http://www.sciencedirect.com/science/article/pii/S2093791119302811
_version_ 1797763137304264704
author Byoung Gook Loh
Gi Heung Choi
author_facet Byoung Gook Loh
Gi Heung Choi
author_sort Byoung Gook Loh
collection DOAJ
description Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scattering–based PM2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R2) of 0.78 and standard error of 5.0 μg/m3, corresponding to 8% increase in R2 and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible. Keywords: Calibration, Machine learning, Monitoring and control, Particulate matter, Web query
first_indexed 2024-03-12T19:37:21Z
format Article
id doaj.art-4aadf9b98c5940b69ba6afb363cb9433
institution Directory Open Access Journal
issn 2093-7911
language English
last_indexed 2024-03-12T19:37:21Z
publishDate 2019-12-01
publisher Elsevier
record_format Article
series Safety and Health at Work
spelling doaj.art-4aadf9b98c5940b69ba6afb363cb94332023-08-02T04:03:18ZengElsevierSafety and Health at Work2093-79112019-12-01104452460Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine LearningByoung Gook Loh0Gi Heung Choi1Department of Applied IT Engineering, Hansung University, Seoul, Republic of KoreaDepartment of Mechanical Systems Engineering, Hansung University, Seoul, Republic of Korea; Corresponding author. Department of Mechanical Systems Engineering, Hansung University, 116 Samseongyoro-16gil Sungbuk-gu, Seoul, 136-792, Republic of Korea.Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scattering–based PM2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R2) of 0.78 and standard error of 5.0 μg/m3, corresponding to 8% increase in R2 and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible. Keywords: Calibration, Machine learning, Monitoring and control, Particulate matter, Web queryhttp://www.sciencedirect.com/science/article/pii/S2093791119302811
spellingShingle Byoung Gook Loh
Gi Heung Choi
Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
Safety and Health at Work
title Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
title_full Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
title_fullStr Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
title_full_unstemmed Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
title_short Calibration of Portable Particulate Matter–Monitoring Device using Web Query and Machine Learning
title_sort calibration of portable particulate matter monitoring device using web query and machine learning
url http://www.sciencedirect.com/science/article/pii/S2093791119302811
work_keys_str_mv AT byounggookloh calibrationofportableparticulatemattermonitoringdeviceusingwebqueryandmachinelearning
AT giheungchoi calibrationofportableparticulatemattermonitoringdeviceusingwebqueryandmachinelearning