Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends

Light-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learnin...

全面介紹

書目詳細資料
Main Authors: Stangierska, M, Bajwa, A, Lewis, A, Akehurst, S, Turner, J, Leach, FCP
格式: Conference item
語言:English
出版: SAE International 2024
_version_ 1826314875607973888
author Stangierska, M
Bajwa, A
Lewis, A
Akehurst, S
Turner, J
Leach, FCP
author_facet Stangierska, M
Bajwa, A
Lewis, A
Akehurst, S
Turner, J
Leach, FCP
author_sort Stangierska, M
collection OXFORD
description Light-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learning technique known as Random Forest can accurately predict particle size, concentration, and accumulation mode geometric standard deviation (GSD) for 23 nm and greater diameter particulate emissions from a highly boosted gasoline direct injection (GDI) engine operating on a single fuel, while also offering insights into the underlying factors of emissions production because of the interpretable nature of decision trees. This work builds on this research as its basis and further investigates the relative performance of five decision tree-based machine learning techniques in predicting these particulate emission parameters and extends the work to 10 nm particles. In addition to Random Forest, the selected techniques consist of four gradient boosting models: GBM, XGBoost, LightGBM, and CatBoost. Moreover, the influences of fuel chemistry are assessed by using data from 13 gasoline fuel blends, including blends with ethanol and methanol – common bio- and e-fuels. The results show that the CatBoost model achieves the highest prediction accuracy (R2 between 0.77 and 0.932), even when the feature set is reduced to improve computational efficiency. Random Forest and LightGBM are also shown to be suitable for PM emissions estimation. Permutation feature importance was used to highlight the dependence of PM emissions on both fuel and engine operating parameters – offering new insights into the effect of fuel properties on particulate emissions and their formation in highly boosted engines.
first_indexed 2024-09-25T04:35:54Z
format Conference item
id oxford-uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738b
institution University of Oxford
language English
last_indexed 2024-12-09T03:12:48Z
publishDate 2024
publisher SAE International
record_format dspace
spelling oxford-uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738b2024-10-14T14:36:21ZEnsemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blendsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738bEnglishSymplectic ElementsSAE International2024Stangierska, MBajwa, ALewis, AAkehurst, STurner, JLeach, FCPLight-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learning technique known as Random Forest can accurately predict particle size, concentration, and accumulation mode geometric standard deviation (GSD) for 23 nm and greater diameter particulate emissions from a highly boosted gasoline direct injection (GDI) engine operating on a single fuel, while also offering insights into the underlying factors of emissions production because of the interpretable nature of decision trees. This work builds on this research as its basis and further investigates the relative performance of five decision tree-based machine learning techniques in predicting these particulate emission parameters and extends the work to 10 nm particles. In addition to Random Forest, the selected techniques consist of four gradient boosting models: GBM, XGBoost, LightGBM, and CatBoost. Moreover, the influences of fuel chemistry are assessed by using data from 13 gasoline fuel blends, including blends with ethanol and methanol – common bio- and e-fuels. The results show that the CatBoost model achieves the highest prediction accuracy (R2 between 0.77 and 0.932), even when the feature set is reduced to improve computational efficiency. Random Forest and LightGBM are also shown to be suitable for PM emissions estimation. Permutation feature importance was used to highlight the dependence of PM emissions on both fuel and engine operating parameters – offering new insights into the effect of fuel properties on particulate emissions and their formation in highly boosted engines.
spellingShingle Stangierska, M
Bajwa, A
Lewis, A
Akehurst, S
Turner, J
Leach, FCP
Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title_full Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title_fullStr Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title_full_unstemmed Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title_short Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
title_sort ensemble machine learning techniques for particulate emissions estimation from a highly boosted gdi engine fuelled by different gasoline blends
work_keys_str_mv AT stangierskam ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends
AT bajwaa ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends
AT lewisa ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends
AT akehursts ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends
AT turnerj ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends
AT leachfcp ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends