Ozone response modeling to NOx and VOC emissions: Examining machine learning models

Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average...

Full description

Bibliographic Details
Main Authors: Cheng-Pin Kuo, Joshua S. Fu
Format: Article
Language:English
Published: Elsevier 2023-06-01
Series:Environment International
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0160412023002428
_version_ 1797812069298339840
author Cheng-Pin Kuo
Joshua S. Fu
author_facet Cheng-Pin Kuo
Joshua S. Fu
author_sort Cheng-Pin Kuo
collection DOAJ
description Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average (MDA8 O3) as an example to examine O3 responses to local anthropogenic NOx and VOC emissions in Taiwan by Response Surface Modeling (RSM). Three different datasets for RSM were examined, including the Community Multiscale Air Quality (CMAQ) model data, ML-measurement-model fusion (ML-MMF) data, and ML data, which respectively represent direct numerical model predictions, numerical predictions adjusted by observations and other auxiliary data, and ML predictions based on observations and other auxiliary data.The results show that both ML-MMF (r = 0.93–0.94) and ML predictions (r = 0.89–0.94) present significantly improved performance in the benchmark case compared with CMAQ predictions (r = 0.41–0.80). While ML-MMF isopleths exhibit O3 nonlinearity close to actual responses due to their numerical base and observation-based correction, ML isopleths present biased predictions concerning their different controlled ranges of O3 and distorted O3 responses to NOx and VOC emission ratios compared with ML-MMF isopleths, which implies that using data without support from CMAQ modeling to predict the air quality could mislead the controlled targets and future trends. Meanwhile, the observation-corrected ML-MMF isopleths also emphasize the impact of transboundary pollution from mainland China on the regional O3 sensitivity to local NOx and VOC emissions, which transboundary NOx would make all air quality regions in April more sensitive to local VOC emissions and limit the potential effort by reducing local emissions.Future ML applications in atmospheric science like forecasting or bias correction should provide interpretability and explainability, except for meeting statistical performance and providing variable importance. Assessment with interpretable physical and chemical mechanisms and constructing a statistically robust ML model should be equally important.
first_indexed 2024-03-13T07:33:08Z
format Article
id doaj.art-00fcc1ec86b24be08febf1933ddeb7d8
institution Directory Open Access Journal
issn 0160-4120
language English
last_indexed 2024-03-13T07:33:08Z
publishDate 2023-06-01
publisher Elsevier
record_format Article
series Environment International
spelling doaj.art-00fcc1ec86b24be08febf1933ddeb7d82023-06-04T04:23:05ZengElsevierEnvironment International0160-41202023-06-01176107969Ozone response modeling to NOx and VOC emissions: Examining machine learning modelsCheng-Pin Kuo0Joshua S. Fu1Department of Civil and Environmental Engineering, University of Tennessee, Knoxville, TN, USADepartment of Civil and Environmental Engineering, University of Tennessee, Knoxville, TN, USA; Department of Atmospheric Sciences, National Central University, Taoyuan, Taiwan; Corresponding author at: Department of Civil and Environmental Engineering, The University of Tennessee, 851 Neyland Drive, Knoxville, TN 37996, USA.Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average (MDA8 O3) as an example to examine O3 responses to local anthropogenic NOx and VOC emissions in Taiwan by Response Surface Modeling (RSM). Three different datasets for RSM were examined, including the Community Multiscale Air Quality (CMAQ) model data, ML-measurement-model fusion (ML-MMF) data, and ML data, which respectively represent direct numerical model predictions, numerical predictions adjusted by observations and other auxiliary data, and ML predictions based on observations and other auxiliary data.The results show that both ML-MMF (r = 0.93–0.94) and ML predictions (r = 0.89–0.94) present significantly improved performance in the benchmark case compared with CMAQ predictions (r = 0.41–0.80). While ML-MMF isopleths exhibit O3 nonlinearity close to actual responses due to their numerical base and observation-based correction, ML isopleths present biased predictions concerning their different controlled ranges of O3 and distorted O3 responses to NOx and VOC emission ratios compared with ML-MMF isopleths, which implies that using data without support from CMAQ modeling to predict the air quality could mislead the controlled targets and future trends. Meanwhile, the observation-corrected ML-MMF isopleths also emphasize the impact of transboundary pollution from mainland China on the regional O3 sensitivity to local NOx and VOC emissions, which transboundary NOx would make all air quality regions in April more sensitive to local VOC emissions and limit the potential effort by reducing local emissions.Future ML applications in atmospheric science like forecasting or bias correction should provide interpretability and explainability, except for meeting statistical performance and providing variable importance. Assessment with interpretable physical and chemical mechanisms and constructing a statistically robust ML model should be equally important.http://www.sciencedirect.com/science/article/pii/S0160412023002428OzoneEmission controlForecastingMachine learningMeasurement-model fusion
spellingShingle Cheng-Pin Kuo
Joshua S. Fu
Ozone response modeling to NOx and VOC emissions: Examining machine learning models
Environment International
Ozone
Emission control
Forecasting
Machine learning
Measurement-model fusion
title Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_full Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_fullStr Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_full_unstemmed Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_short Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_sort ozone response modeling to nox and voc emissions examining machine learning models
topic Ozone
Emission control
Forecasting
Machine learning
Measurement-model fusion
url http://www.sciencedirect.com/science/article/pii/S0160412023002428
work_keys_str_mv AT chengpinkuo ozoneresponsemodelingtonoxandvocemissionsexaminingmachinelearningmodels
AT joshuasfu ozoneresponsemodelingtonoxandvocemissionsexaminingmachinelearningmodels