Ozone response modeling to NOx and VOC emissions: Examining machine learning models

Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average...

Full description

Bibliographic Details
Main Authors:	Cheng-Pin Kuo, Joshua S. Fu
Format:	Article
Language:	English
Published:	Elsevier 2023-06-01
Series:	Environment International
Subjects:	Ozone Emission control Forecasting Machine learning Measurement-model fusion
Online Access:	http://www.sciencedirect.com/science/article/pii/S0160412023002428

_version_	1797812069298339840
author	Cheng-Pin Kuo Joshua S. Fu
author_facet	Cheng-Pin Kuo Joshua S. Fu
author_sort	Cheng-Pin Kuo
collection	DOAJ
description	Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average (MDA8 O3) as an example to examine O3 responses to local anthropogenic NOx and VOC emissions in Taiwan by Response Surface Modeling (RSM). Three different datasets for RSM were examined, including the Community Multiscale Air Quality (CMAQ) model data, ML-measurement-model fusion (ML-MMF) data, and ML data, which respectively represent direct numerical model predictions, numerical predictions adjusted by observations and other auxiliary data, and ML predictions based on observations and other auxiliary data.The results show that both ML-MMF (r = 0.93–0.94) and ML predictions (r = 0.89–0.94) present significantly improved performance in the benchmark case compared with CMAQ predictions (r = 0.41–0.80). While ML-MMF isopleths exhibit O3 nonlinearity close to actual responses due to their numerical base and observation-based correction, ML isopleths present biased predictions concerning their different controlled ranges of O3 and distorted O3 responses to NOx and VOC emission ratios compared with ML-MMF isopleths, which implies that using data without support from CMAQ modeling to predict the air quality could mislead the controlled targets and future trends. Meanwhile, the observation-corrected ML-MMF isopleths also emphasize the impact of transboundary pollution from mainland China on the regional O3 sensitivity to local NOx and VOC emissions, which transboundary NOx would make all air quality regions in April more sensitive to local VOC emissions and limit the potential effort by reducing local emissions.Future ML applications in atmospheric science like forecasting or bias correction should provide interpretability and explainability, except for meeting statistical performance and providing variable importance. Assessment with interpretable physical and chemical mechanisms and constructing a statistically robust ML model should be equally important.
first_indexed	2024-03-13T07:33:08Z
format	Article
id	doaj.art-00fcc1ec86b24be08febf1933ddeb7d8
institution	Directory Open Access Journal
issn	0160-4120
language	English
last_indexed	2024-03-13T07:33:08Z
publishDate	2023-06-01
publisher	Elsevier
record_format	Article
series	Environment International
spelling	doaj.art-00fcc1ec86b24be08febf1933ddeb7d82023-06-04T04:23:05ZengElsevierEnvironment International0160-41202023-06-01176107969Ozone response modeling to NOx and VOC emissions: Examining machine learning modelsCheng-Pin Kuo0Joshua S. Fu1Department of Civil and Environmental Engineering, University of Tennessee, Knoxville, TN, USADepartment of Civil and Environmental Engineering, University of Tennessee, Knoxville, TN, USA; Department of Atmospheric Sciences, National Central University, Taoyuan, Taiwan; Corresponding author at: Department of Civil and Environmental Engineering, The University of Tennessee, 851 Neyland Drive, Knoxville, TN 37996, USA.Current machine learning (ML) applications in atmospheric science focus on forecasting and bias correction for numerical modeling estimations, but few studies examined the nonlinear response of their predictions to precursor emissions. This study uses ground-level maximum daily 8-hour ozone average (MDA8 O3) as an example to examine O3 responses to local anthropogenic NOx and VOC emissions in Taiwan by Response Surface Modeling (RSM). Three different datasets for RSM were examined, including the Community Multiscale Air Quality (CMAQ) model data, ML-measurement-model fusion (ML-MMF) data, and ML data, which respectively represent direct numerical model predictions, numerical predictions adjusted by observations and other auxiliary data, and ML predictions based on observations and other auxiliary data.The results show that both ML-MMF (r = 0.93–0.94) and ML predictions (r = 0.89–0.94) present significantly improved performance in the benchmark case compared with CMAQ predictions (r = 0.41–0.80). While ML-MMF isopleths exhibit O3 nonlinearity close to actual responses due to their numerical base and observation-based correction, ML isopleths present biased predictions concerning their different controlled ranges of O3 and distorted O3 responses to NOx and VOC emission ratios compared with ML-MMF isopleths, which implies that using data without support from CMAQ modeling to predict the air quality could mislead the controlled targets and future trends. Meanwhile, the observation-corrected ML-MMF isopleths also emphasize the impact of transboundary pollution from mainland China on the regional O3 sensitivity to local NOx and VOC emissions, which transboundary NOx would make all air quality regions in April more sensitive to local VOC emissions and limit the potential effort by reducing local emissions.Future ML applications in atmospheric science like forecasting or bias correction should provide interpretability and explainability, except for meeting statistical performance and providing variable importance. Assessment with interpretable physical and chemical mechanisms and constructing a statistically robust ML model should be equally important.http://www.sciencedirect.com/science/article/pii/S0160412023002428OzoneEmission controlForecastingMachine learningMeasurement-model fusion
spellingShingle	Cheng-Pin Kuo Joshua S. Fu Ozone response modeling to NOx and VOC emissions: Examining machine learning models Environment International Ozone Emission control Forecasting Machine learning Measurement-model fusion
title	Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_full	Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_fullStr	Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_full_unstemmed	Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_short	Ozone response modeling to NOx and VOC emissions: Examining machine learning models
title_sort	ozone response modeling to nox and voc emissions examining machine learning models
topic	Ozone Emission control Forecasting Machine learning Measurement-model fusion
url	http://www.sciencedirect.com/science/article/pii/S0160412023002428
work_keys_str_mv	AT chengpinkuo ozoneresponsemodelingtonoxandvocemissionsexaminingmachinelearningmodels AT joshuasfu ozoneresponsemodelingtonoxandvocemissionsexaminingmachinelearningmodels

Ozone response modeling to NOx and VOC emissions: Examining machine learning models

Similar Items