SSC: The novel self-stack ensemble model for thyroid disease prediction.

Thyroid disease presents a significant health risk, lowering the quality of life and increasing treatment costs. The diagnosis of thyroid disease can be challenging, especially for inexperienced practitioners. Machine learning has been established as one of the methods for disease diagnosis based on...

Full description

Bibliographic Details
Main Author: Shengjun Ji
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0295501&type=printable
_version_ 1797361195151261696
author Shengjun Ji
author_facet Shengjun Ji
author_sort Shengjun Ji
collection DOAJ
description Thyroid disease presents a significant health risk, lowering the quality of life and increasing treatment costs. The diagnosis of thyroid disease can be challenging, especially for inexperienced practitioners. Machine learning has been established as one of the methods for disease diagnosis based on previous studies. This research introduces a novel and more effective technique for predicting thyroid disease by utilizing machine learning methodologies, surpassing the performance of previous studies in this field. This study utilizes the UCI thyroid disease dataset, which consists of 9172 samples and 30 features, and exhibits a highly imbalanced target class distribution. However, machine learning algorithms trained on imbalanced thyroid disease data face challenges in reliably detecting minority data and disease. To address this issue, re-sampling is employed, which modifies the ratio between target classes to balance the data. In this study, the down-sampling approach is utilized to achieve a balanced distribution of target classes. A novel RF-based self-stacking classifier is presented in this research for efficient thyroid disease detection. The proposed approach demonstrates the ability to diagnose primary hypothyroidism, increased binding protein, compensated hypothyroidism, and concurrent non-thyroidal illness with an accuracy of 99.5%. The recommended model exhibits state-of-the-art performance, achieving 100% macro precision, 100% macro recall, and 100% macro F1-score. A thorough comparative assessment is conducted to demonstrate the viability of the proposed approach, including several machine learning classifiers, deep neural networks, and ensemble voting classifiers. The results of K-fold cross-validation provide further support for the efficacy of the proposed self-stacking classifier.
first_indexed 2024-03-08T15:50:27Z
format Article
id doaj.art-2847a20471464742b47cb2ffc45bb843
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-03-08T15:50:27Z
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-2847a20471464742b47cb2ffc45bb8432024-01-09T05:31:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-01191e029550110.1371/journal.pone.0295501SSC: The novel self-stack ensemble model for thyroid disease prediction.Shengjun JiThyroid disease presents a significant health risk, lowering the quality of life and increasing treatment costs. The diagnosis of thyroid disease can be challenging, especially for inexperienced practitioners. Machine learning has been established as one of the methods for disease diagnosis based on previous studies. This research introduces a novel and more effective technique for predicting thyroid disease by utilizing machine learning methodologies, surpassing the performance of previous studies in this field. This study utilizes the UCI thyroid disease dataset, which consists of 9172 samples and 30 features, and exhibits a highly imbalanced target class distribution. However, machine learning algorithms trained on imbalanced thyroid disease data face challenges in reliably detecting minority data and disease. To address this issue, re-sampling is employed, which modifies the ratio between target classes to balance the data. In this study, the down-sampling approach is utilized to achieve a balanced distribution of target classes. A novel RF-based self-stacking classifier is presented in this research for efficient thyroid disease detection. The proposed approach demonstrates the ability to diagnose primary hypothyroidism, increased binding protein, compensated hypothyroidism, and concurrent non-thyroidal illness with an accuracy of 99.5%. The recommended model exhibits state-of-the-art performance, achieving 100% macro precision, 100% macro recall, and 100% macro F1-score. A thorough comparative assessment is conducted to demonstrate the viability of the proposed approach, including several machine learning classifiers, deep neural networks, and ensemble voting classifiers. The results of K-fold cross-validation provide further support for the efficacy of the proposed self-stacking classifier.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0295501&type=printable
spellingShingle Shengjun Ji
SSC: The novel self-stack ensemble model for thyroid disease prediction.
PLoS ONE
title SSC: The novel self-stack ensemble model for thyroid disease prediction.
title_full SSC: The novel self-stack ensemble model for thyroid disease prediction.
title_fullStr SSC: The novel self-stack ensemble model for thyroid disease prediction.
title_full_unstemmed SSC: The novel self-stack ensemble model for thyroid disease prediction.
title_short SSC: The novel self-stack ensemble model for thyroid disease prediction.
title_sort ssc the novel self stack ensemble model for thyroid disease prediction
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0295501&type=printable
work_keys_str_mv AT shengjunji sscthenovelselfstackensemblemodelforthyroiddiseaseprediction