Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends
Light-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learnin...
Autores principales: | , , , , , |
---|---|
Formato: | Conference item |
Lenguaje: | English |
Publicado: |
SAE International
2024
|
_version_ | 1826314875607973888 |
---|---|
author | Stangierska, M Bajwa, A Lewis, A Akehurst, S Turner, J Leach, FCP |
author_facet | Stangierska, M Bajwa, A Lewis, A Akehurst, S Turner, J Leach, FCP |
author_sort | Stangierska, M |
collection | OXFORD |
description | Light-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learning technique known as Random Forest can accurately predict particle size, concentration, and accumulation mode geometric standard deviation (GSD) for 23 nm and greater diameter particulate emissions from a highly boosted gasoline direct injection (GDI) engine operating on a single fuel, while also offering insights into the underlying factors of emissions production because of the interpretable nature of decision trees. This work builds on this research as its basis and further investigates the relative performance of five decision tree-based machine learning techniques in predicting these particulate emission parameters and extends the work to 10 nm particles. In addition to Random Forest, the selected techniques consist of four gradient boosting models: GBM, XGBoost, LightGBM, and CatBoost. Moreover, the influences of fuel chemistry are assessed by using data from 13 gasoline fuel blends, including blends with ethanol and methanol – common bio- and e-fuels. The results show that the CatBoost model achieves the highest prediction accuracy (R2 between 0.77 and 0.932), even when the feature set is reduced to improve computational efficiency. Random Forest and LightGBM are also shown to be suitable for PM emissions estimation. Permutation feature importance was used to highlight the dependence of PM emissions on both fuel and engine operating parameters – offering new insights into the effect of fuel properties on particulate emissions and their formation in highly boosted engines. |
first_indexed | 2024-09-25T04:35:54Z |
format | Conference item |
id | oxford-uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738b |
institution | University of Oxford |
language | English |
last_indexed | 2024-12-09T03:12:48Z |
publishDate | 2024 |
publisher | SAE International |
record_format | dspace |
spelling | oxford-uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738b2024-10-14T14:36:21ZEnsemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blendsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:f515aefc-1ed5-4dfa-bc1b-4fa2deaa738bEnglishSymplectic ElementsSAE International2024Stangierska, MBajwa, ALewis, AAkehurst, STurner, JLeach, FCPLight-duty vehicle emissions regulations worldwide impose stringent limits on particulate matter (PM) emissions, necessitating accurate modelling and prediction of particulate emissions across a range of sizes (as low as 10 nm). It has been shown that the decision tree-based ensemble machine learning technique known as Random Forest can accurately predict particle size, concentration, and accumulation mode geometric standard deviation (GSD) for 23 nm and greater diameter particulate emissions from a highly boosted gasoline direct injection (GDI) engine operating on a single fuel, while also offering insights into the underlying factors of emissions production because of the interpretable nature of decision trees. This work builds on this research as its basis and further investigates the relative performance of five decision tree-based machine learning techniques in predicting these particulate emission parameters and extends the work to 10 nm particles. In addition to Random Forest, the selected techniques consist of four gradient boosting models: GBM, XGBoost, LightGBM, and CatBoost. Moreover, the influences of fuel chemistry are assessed by using data from 13 gasoline fuel blends, including blends with ethanol and methanol – common bio- and e-fuels. The results show that the CatBoost model achieves the highest prediction accuracy (R2 between 0.77 and 0.932), even when the feature set is reduced to improve computational efficiency. Random Forest and LightGBM are also shown to be suitable for PM emissions estimation. Permutation feature importance was used to highlight the dependence of PM emissions on both fuel and engine operating parameters – offering new insights into the effect of fuel properties on particulate emissions and their formation in highly boosted engines. |
spellingShingle | Stangierska, M Bajwa, A Lewis, A Akehurst, S Turner, J Leach, FCP Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title | Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title_full | Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title_fullStr | Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title_full_unstemmed | Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title_short | Ensemble machine learning techniques for particulate emissions estimation from a highly boosted GDI engine fuelled by different gasoline blends |
title_sort | ensemble machine learning techniques for particulate emissions estimation from a highly boosted gdi engine fuelled by different gasoline blends |
work_keys_str_mv | AT stangierskam ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends AT bajwaa ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends AT lewisa ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends AT akehursts ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends AT turnerj ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends AT leachfcp ensemblemachinelearningtechniquesforparticulateemissionsestimationfromahighlyboostedgdienginefuelledbydifferentgasolineblends |