Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection

High-resolution, high-quality exposure modeling is critical for assessing the health effects of ambient PM2.5 in epidemiological studies. Using sparse regulatory PM2.5 measurements as principal model inputs may result in two issues in exposure prediction: (1) they may affect the models’ accuracy in...

Full description

Bibliographic Details
Main Authors: Jianzhao Bi, Nancy Carmona, Magali N. Blanco, Amanda J. Gassett, Edmund Seto, Adam A. Szpiro, Timothy V. Larson, Paul D. Sampson, Joel D. Kaufman, Lianne Sheppard
Format: Article
Language:English
Published: Elsevier 2022-01-01
Series:Environment International
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0160412021005225
_version_ 1819179206964150272
author Jianzhao Bi
Nancy Carmona
Magali N. Blanco
Amanda J. Gassett
Edmund Seto
Adam A. Szpiro
Timothy V. Larson
Paul D. Sampson
Joel D. Kaufman
Lianne Sheppard
author_facet Jianzhao Bi
Nancy Carmona
Magali N. Blanco
Amanda J. Gassett
Edmund Seto
Adam A. Szpiro
Timothy V. Larson
Paul D. Sampson
Joel D. Kaufman
Lianne Sheppard
author_sort Jianzhao Bi
collection DOAJ
description High-resolution, high-quality exposure modeling is critical for assessing the health effects of ambient PM2.5 in epidemiological studies. Using sparse regulatory PM2.5 measurements as principal model inputs may result in two issues in exposure prediction: (1) they may affect the models’ accuracy in predicting PM2.5 spatial distribution; (2) the internal validation based on these measurements may not reliably reflect the model performance at locations of interest (e.g., a cohort’s residential locations). In this study, we used the PM2.5 measurements from a publicly available commercial low-cost PM2.5 network, PurpleAir, with an external validation dataset at the residential locations of a representative sample of participants from the Adult Changes in Thought - Air Pollution (ACT-AP) study, to improve the accuracy of exposure prediction at the cohort participant locations. We also proposed a metric based on principal component analysis (PCA) - the PCA distance - to assess the similarity between monitor and cohort locations to guide monitor deployment and data selection. The analysis was based on a spatiotemporal modeling framework with 51 “gold-standard” monitors and 58 PurpleAir monitors for model development, as well as 105 home monitors at the cohort locations for model validation, in the Puget Sound region of Washington State from June 2017 to March 2019. After including calibrated PurpleAir measurements as part of the dependent variable, the external spatiotemporal validation R2 and root-mean-square error, RMSE, for two-week concentration averages improved from 0.84 and 2.22 μg/m3 to 0.92 and 1.63 μg/m3, respectively. The external spatial validation R2 and RMSE for long-term averages over the modeling period improved from 0.72 and 1.01 μg/m3 to 0.79 and 0.88 μg/m3, respectively. The exposure predictions incorporating PurpleAir measurements demonstrated sharper urban-suburban concentration gradients. The PurpleAir monitors with shorter PCA distances improved the model’s prediction accuracy more substantially than the monitors with longer PCA distances, supporting the use of this similarity metric.
first_indexed 2024-12-22T21:54:46Z
format Article
id doaj.art-8558f7ef7604410b804d3e2568a79b59
institution Directory Open Access Journal
issn 0160-4120
language English
last_indexed 2024-12-22T21:54:46Z
publishDate 2022-01-01
publisher Elsevier
record_format Article
series Environment International
spelling doaj.art-8558f7ef7604410b804d3e2568a79b592022-12-21T18:11:17ZengElsevierEnvironment International0160-41202022-01-01158106897Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selectionJianzhao Bi0Nancy Carmona1Magali N. Blanco2Amanda J. Gassett3Edmund Seto4Adam A. Szpiro5Timothy V. Larson6Paul D. Sampson7Joel D. Kaufman8Lianne Sheppard9Department of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USA; Corresponding author at: Department of Environmental & Occupational Health Sciences, University of Washington, 4225 Roosevelt Way NE, Seattle, WA 98105, USA.Department of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USADepartment of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USADepartment of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USADepartment of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USADepartment of Biostatistics, University of Washington, Seattle, WA, USADepartment of Civil & Environmental Engineering, University of Washington, Seattle, WA, USADepartment of Statistics, University of Washington, Seattle, WA, USADepartment of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USA; Department of Medicine, University of Washington, Seattle, WA, USA; Department of Epidemiology, University of Washington, Seattle, WA, USADepartment of Environmental & Occupational Health Sciences, University of Washington, Seattle, WA, USA; Department of Biostatistics, University of Washington, Seattle, WA, USAHigh-resolution, high-quality exposure modeling is critical for assessing the health effects of ambient PM2.5 in epidemiological studies. Using sparse regulatory PM2.5 measurements as principal model inputs may result in two issues in exposure prediction: (1) they may affect the models’ accuracy in predicting PM2.5 spatial distribution; (2) the internal validation based on these measurements may not reliably reflect the model performance at locations of interest (e.g., a cohort’s residential locations). In this study, we used the PM2.5 measurements from a publicly available commercial low-cost PM2.5 network, PurpleAir, with an external validation dataset at the residential locations of a representative sample of participants from the Adult Changes in Thought - Air Pollution (ACT-AP) study, to improve the accuracy of exposure prediction at the cohort participant locations. We also proposed a metric based on principal component analysis (PCA) - the PCA distance - to assess the similarity between monitor and cohort locations to guide monitor deployment and data selection. The analysis was based on a spatiotemporal modeling framework with 51 “gold-standard” monitors and 58 PurpleAir monitors for model development, as well as 105 home monitors at the cohort locations for model validation, in the Puget Sound region of Washington State from June 2017 to March 2019. After including calibrated PurpleAir measurements as part of the dependent variable, the external spatiotemporal validation R2 and root-mean-square error, RMSE, for two-week concentration averages improved from 0.84 and 2.22 μg/m3 to 0.92 and 1.63 μg/m3, respectively. The external spatial validation R2 and RMSE for long-term averages over the modeling period improved from 0.72 and 1.01 μg/m3 to 0.79 and 0.88 μg/m3, respectively. The exposure predictions incorporating PurpleAir measurements demonstrated sharper urban-suburban concentration gradients. The PurpleAir monitors with shorter PCA distances improved the model’s prediction accuracy more substantially than the monitors with longer PCA distances, supporting the use of this similarity metric.http://www.sciencedirect.com/science/article/pii/S0160412021005225PurpleAirHigh-resolutionExposure assessmentFine particulate matterModel validation
spellingShingle Jianzhao Bi
Nancy Carmona
Magali N. Blanco
Amanda J. Gassett
Edmund Seto
Adam A. Szpiro
Timothy V. Larson
Paul D. Sampson
Joel D. Kaufman
Lianne Sheppard
Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
Environment International
PurpleAir
High-resolution
Exposure assessment
Fine particulate matter
Model validation
title Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
title_full Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
title_fullStr Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
title_full_unstemmed Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
title_short Publicly available low-cost sensor measurements for PM2.5 exposure modeling: Guidance for monitor deployment and data selection
title_sort publicly available low cost sensor measurements for pm2 5 exposure modeling guidance for monitor deployment and data selection
topic PurpleAir
High-resolution
Exposure assessment
Fine particulate matter
Model validation
url http://www.sciencedirect.com/science/article/pii/S0160412021005225
work_keys_str_mv AT jianzhaobi publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT nancycarmona publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT magalinblanco publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT amandajgassett publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT edmundseto publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT adamaszpiro publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT timothyvlarson publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT pauldsampson publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT joeldkaufman publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection
AT liannesheppard publiclyavailablelowcostsensormeasurementsforpm25exposuremodelingguidanceformonitordeploymentanddataselection