A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain
(1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-02-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/4/2191 |
_version_ | 1827758832431923200 |
---|---|
author | Mario Fordellone Ilaria De Benedictis Dario Bruzzese Paolo Chiodini |
author_facet | Mario Fordellone Ilaria De Benedictis Dario Bruzzese Paolo Chiodini |
author_sort | Mario Fordellone |
collection | DOAJ |
description | (1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre-cancer before they have developed symptoms. Precise detection, however, often mainly relies on human experience and this could suffer from human error and error with a visual inspection. (2) Methods: The research of statistical approaches to analyze the complex structure of data is increasing. In this work, an entropy-based fuzzy clustering technique for interval-valued data (EFC-ID) for cancer detection is suggested. (3) Results: The application on the Breast dataset shows that EFC-ID performs better than the conventional FKM in terms of AUC value (EFC-ID = 0.96, FKM = 0.88), sensitivity (EFC-ID = 0.90, FKM = 0.64), and specificity (EFC-ID = 0.93, FKM = 0.92). Furthermore, the application on the Multiple Myeloma data shows that EFC-ID performs better than the conventional FKM in terms of Chi-squared (EFC-ID = 91.64, FKM = 88.26), Accuracy rate (EFC-ID = 0.71, FKM = 0.60), and Adjusted Rand Index (EFC-ID = 0.33, FKM = 0.21). (4) Conclusions: In all cases, the proposed approach has shown good performance in identifying the natural partition and the advantages of the use of EFC-ID have been detailed illustrated. |
first_indexed | 2024-03-11T09:13:05Z |
format | Article |
id | doaj.art-75373a35325144d99d4b34bfe7af1a55 |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-11T09:13:05Z |
publishDate | 2023-02-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-75373a35325144d99d4b34bfe7af1a552023-11-16T18:52:09ZengMDPI AGApplied Sciences2076-34172023-02-01134219110.3390/app13042191A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are UncertainMario Fordellone0Ilaria De Benedictis1Dario Bruzzese2Paolo Chiodini3Medical Statistics Unit, Universitiy of Campania “Luigi Vanvitelli”, 81100 Naples, ItalyUniversitiy of Campania “Luigi Vanvitelli”, 81100 Naples, ItalyDepartment of Public Health, University of Naples Federico II, 80131 Naples, ItalyMedical Statistics Unit, Universitiy of Campania “Luigi Vanvitelli”, 81100 Naples, Italy(1) Background: Cancer is a leading cause of death worldwide and each year, approximately 400,000 children develop cancer. Early detection of cancer greatly increases the chances for successful treatment, while screening aims to identify individuals with findings suggestive of specific cancer or pre-cancer before they have developed symptoms. Precise detection, however, often mainly relies on human experience and this could suffer from human error and error with a visual inspection. (2) Methods: The research of statistical approaches to analyze the complex structure of data is increasing. In this work, an entropy-based fuzzy clustering technique for interval-valued data (EFC-ID) for cancer detection is suggested. (3) Results: The application on the Breast dataset shows that EFC-ID performs better than the conventional FKM in terms of AUC value (EFC-ID = 0.96, FKM = 0.88), sensitivity (EFC-ID = 0.90, FKM = 0.64), and specificity (EFC-ID = 0.93, FKM = 0.92). Furthermore, the application on the Multiple Myeloma data shows that EFC-ID performs better than the conventional FKM in terms of Chi-squared (EFC-ID = 91.64, FKM = 88.26), Accuracy rate (EFC-ID = 0.71, FKM = 0.60), and Adjusted Rand Index (EFC-ID = 0.33, FKM = 0.21). (4) Conclusions: In all cases, the proposed approach has shown good performance in identifying the natural partition and the advantages of the use of EFC-ID have been detailed illustrated.https://www.mdpi.com/2076-3417/13/4/2191cancer detectioncancer classificationunsupervised classificationentropy regularization procedurepenalized classification modelinterval-valued data |
spellingShingle | Mario Fordellone Ilaria De Benedictis Dario Bruzzese Paolo Chiodini A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain Applied Sciences cancer detection cancer classification unsupervised classification entropy regularization procedure penalized classification model interval-valued data |
title | A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain |
title_full | A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain |
title_fullStr | A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain |
title_full_unstemmed | A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain |
title_short | A Maximum-Entropy Fuzzy Clustering Approach for Cancer Detection When Data Are Uncertain |
title_sort | maximum entropy fuzzy clustering approach for cancer detection when data are uncertain |
topic | cancer detection cancer classification unsupervised classification entropy regularization procedure penalized classification model interval-valued data |
url | https://www.mdpi.com/2076-3417/13/4/2191 |
work_keys_str_mv | AT mariofordellone amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT ilariadebenedictis amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT dariobruzzese amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT paolochiodini amaximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT mariofordellone maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT ilariadebenedictis maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT dariobruzzese maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain AT paolochiodini maximumentropyfuzzyclusteringapproachforcancerdetectionwhendataareuncertain |