The effect of feature normalization methods in radiomics

Abstract Objectives In radiomics, different feature normalization methods, such as z-Score or Min–Max, are currently utilized, but their specific impact on the model is unclear. We aimed to measure their effect on the predictive performance and the feature selection. Methods We employed fifteen publ...

Full description

Bibliographic Details
Main Author:	Aydin Demircioğlu
Format:	Article
Language:	English
Published:	SpringerOpen 2024-01-01
Series:	Insights into Imaging
Subjects:	Feature normalization Feature scaling Feature selection Radiomics High-dimensional datasets
Online Access:	https://doi.org/10.1186/s13244-023-01575-7

_version_	1797355746201960448
author	Aydin Demircioğlu
author_facet	Aydin Demircioğlu
author_sort	Aydin Demircioğlu
collection	DOAJ
description	Abstract Objectives In radiomics, different feature normalization methods, such as z-Score or Min–Max, are currently utilized, but their specific impact on the model is unclear. We aimed to measure their effect on the predictive performance and the feature selection. Methods We employed fifteen publicly available radiomics datasets to compare seven normalization methods. Using four feature selection and classifier methods, we used cross-validation to measure the area under the curve (AUC) of the resulting models, the agreement of selected features, and the model calibration. In addition, we assessed whether normalization before cross-validation introduces bias. Results On average, the difference between the normalization methods was relatively small, with a gain of at most + 0.012 in AUC when comparing the z-Score (mean AUC: 0.707 ± 0.102) to no normalization (mean AUC: 0.719 ± 0.107). However, on some datasets, the difference reached + 0.051. The z-Score performed best, while the tanh transformation showed the worst performance and even decreased the overall predictive performance. While quantile transformation performed, on average, slightly worse than the z-Score, it outperformed all other methods on one out of three datasets. The agreement between the features selected by different normalization methods was only mild, reaching at most 62%. Applying the normalization before cross-validation did not introduce significant bias. Conclusion The choice of the feature normalization method influenced the predictive performance but depended strongly on the dataset. It strongly impacted the set of selected features. Critical relevance statement Feature normalization plays a crucial role in the preprocessing and influences the predictive performance and the selected features, complicating feature interpretation. Key points • The impact of feature normalization methods on radiomic models was measured. • Normalization methods performed similarly on average, but differed more strongly on some datasets. • Different methods led to different sets of selected features, impeding feature interpretation. • Model calibration was not largely affected by the normalization method. Graphical Abstract
first_indexed	2024-03-08T14:15:28Z
format	Article
id	doaj.art-bb06a148051f402480525730fabe7a7d
institution	Directory Open Access Journal
issn	1869-4101
language	English
last_indexed	2024-03-08T14:15:28Z
publishDate	2024-01-01
publisher	SpringerOpen
record_format	Article
series	Insights into Imaging
spelling	doaj.art-bb06a148051f402480525730fabe7a7d2024-01-14T12:25:47ZengSpringerOpenInsights into Imaging1869-41012024-01-0115111110.1186/s13244-023-01575-7The effect of feature normalization methods in radiomicsAydin Demircioğlu0Institute of Diagnostic and Interventional Radiology and Neuroradiology, University Hospital EssenAbstract Objectives In radiomics, different feature normalization methods, such as z-Score or Min–Max, are currently utilized, but their specific impact on the model is unclear. We aimed to measure their effect on the predictive performance and the feature selection. Methods We employed fifteen publicly available radiomics datasets to compare seven normalization methods. Using four feature selection and classifier methods, we used cross-validation to measure the area under the curve (AUC) of the resulting models, the agreement of selected features, and the model calibration. In addition, we assessed whether normalization before cross-validation introduces bias. Results On average, the difference between the normalization methods was relatively small, with a gain of at most + 0.012 in AUC when comparing the z-Score (mean AUC: 0.707 ± 0.102) to no normalization (mean AUC: 0.719 ± 0.107). However, on some datasets, the difference reached + 0.051. The z-Score performed best, while the tanh transformation showed the worst performance and even decreased the overall predictive performance. While quantile transformation performed, on average, slightly worse than the z-Score, it outperformed all other methods on one out of three datasets. The agreement between the features selected by different normalization methods was only mild, reaching at most 62%. Applying the normalization before cross-validation did not introduce significant bias. Conclusion The choice of the feature normalization method influenced the predictive performance but depended strongly on the dataset. It strongly impacted the set of selected features. Critical relevance statement Feature normalization plays a crucial role in the preprocessing and influences the predictive performance and the selected features, complicating feature interpretation. Key points • The impact of feature normalization methods on radiomic models was measured. • Normalization methods performed similarly on average, but differed more strongly on some datasets. • Different methods led to different sets of selected features, impeding feature interpretation. • Model calibration was not largely affected by the normalization method. Graphical Abstracthttps://doi.org/10.1186/s13244-023-01575-7Feature normalizationFeature scalingFeature selectionRadiomicsHigh-dimensional datasets
spellingShingle	Aydin Demircioğlu The effect of feature normalization methods in radiomics Insights into Imaging Feature normalization Feature scaling Feature selection Radiomics High-dimensional datasets
title	The effect of feature normalization methods in radiomics
title_full	The effect of feature normalization methods in radiomics
title_fullStr	The effect of feature normalization methods in radiomics
title_full_unstemmed	The effect of feature normalization methods in radiomics
title_short	The effect of feature normalization methods in radiomics
title_sort	effect of feature normalization methods in radiomics
topic	Feature normalization Feature scaling Feature selection Radiomics High-dimensional datasets
url	https://doi.org/10.1186/s13244-023-01575-7
work_keys_str_mv	AT aydindemircioglu theeffectoffeaturenormalizationmethodsinradiomics AT aydindemircioglu effectoffeaturenormalizationmethodsinradiomics

The effect of feature normalization methods in radiomics

Similar Items