A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain

(1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre...

Full description

Bibliographic Details
Main Authors: Mario Fordellone, Ilaria De Benedictis, Dario Bruzzese, Paolo Chiodini
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/4/2191
_version_ 1827758832431923200
author Mario Fordellone
Ilaria De Benedictis
Dario Bruzzese
Paolo Chiodini
author_facet Mario Fordellone
Ilaria De Benedictis
Dario Bruzzese
Paolo Chiodini
author_sort Mario Fordellone
collection DOAJ
description (1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre-cancer before they have developed symptoms. Precise detection, however, often mainly relies on human experience and this could suffer from human error and error with a visual inspection. (2) Methods: The research of statistical approaches to analyze the complex structure of data is increasing. In this work, an entropy-based fuzzy clustering technique for interval-valued data (EFC-ID) for cancer detection is suggested. (3) Results: The application on the Breast dataset shows that EFC-ID performs better than the conventional FKM in terms of AUC value (EFC-ID = 0.96, FKM = 0.88), sensitivity (EFC-ID = 0.90, FKM = 0.64), and specificity (EFC-ID = 0.93, FKM = 0.92). Furthermore, the application on the Multiple Myeloma data shows that EFC-ID performs better than the conventional FKM in terms of Chi-squared (EFC-ID = 91.64, FKM = 88.26), Accuracy rate (EFC-ID = 0.71, FKM = 0.60), and Adjusted Rand Index (EFC-ID = 0.33, FKM = 0.21). (4) Conclusions: In all cases, the proposed approach has shown good performance in identifying the natural partition and the advantages of the use of EFC-ID have been detailed illustrated.
first_indexed 2024-03-11T09:13:05Z
format Article
id doaj.art-75373a35325144d99d4b34bfe7af1a55
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T09:13:05Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-75373a35325144d99d4b34bfe7af1a552023-11-16T18:52:09ZengMDPI AGApplied Sciences2076-34172023-02-01134219110.3390/app13042191A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are UncertainMario Fordellone0Ilaria De Benedictis1Dario Bruzzese2Paolo Chiodini3Medical Statistics Unit, Universitiy of Campania “Luigi Vanvitelli”, 81100 Naples, ItalyUniversitiy of Campania “Luigi Vanvitelli”, 81100 Naples, ItalyDepartment of Public Health, University of Naples Federico II, 80131 Naples, ItalyMedical Statistics Unit, Universitiy of Campania “Luigi Vanvitelli”, 81100 Naples, Italy(1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre-cancer before they have developed symptoms. Precise detection, however, often mainly relies on human experience and this could suffer from human error and error with a visual inspection. (2) Methods: The research of statistical approaches to analyze the complex structure of data is increasing. In this work, an entropy-based fuzzy clustering technique for interval-valued data (EFC-ID) for cancer detection is suggested. (3) Results: The application on the Breast dataset shows that EFC-ID performs better than the conventional FKM in terms of AUC value (EFC-ID = 0.96, FKM = 0.88), sensitivity (EFC-ID = 0.90, FKM = 0.64), and specificity (EFC-ID = 0.93, FKM = 0.92). Furthermore, the application on the Multiple Myeloma data shows that EFC-ID performs better than the conventional FKM in terms of Chi-squared (EFC-ID = 91.64, FKM = 88.26), Accuracy rate (EFC-ID = 0.71, FKM = 0.60), and Adjusted Rand Index (EFC-ID = 0.33, FKM = 0.21). (4) Conclusions: In all cases, the proposed approach has shown good performance in identifying the natural partition and the advantages of the use of EFC-ID have been detailed illustrated.https://www.mdpi.com/2076-3417/13/4/2191cancer detectioncancer classificationunsupervised classificationentropy regularization procedurepenalized classification modelinterval-valued data
spellingShingle Mario Fordellone
Ilaria De Benedictis
Dario Bruzzese
Paolo Chiodini
A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
Applied Sciences
cancer detection
cancer classification
unsupervised classification
entropy regularization procedure
penalized classification model
interval-valued data
title A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
title_full A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
title_fullStr A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
title_full_unstemmed A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
title_short A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
title_sort maximum entropy fuzzy clustering approach for cancer detection when data are uncertain
topic cancer detection
cancer classification
unsupervised classification
entropy regularization procedure
penalized classification model
interval-valued data
url https://www.mdpi.com/2076-3417/13/4/2191
work_keys_str_mv AT mariofordellone amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT ilariadebenedictis amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT dariobruzzese amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT paolochiodini amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT mariofordellone maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT ilariadebenedictis maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT dariobruzzese maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain
AT paolochiodini maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain