Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires
Abstract Background Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and f...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-10-01
|
Series: | Reproductive Biology and Endocrinology |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12958-023-01156-9 |
_version_ | 1797647132052684800 |
---|---|
author | Krystian Zieliński Dajana Drabczyk Michał Kunicki Damian Drzyzga Anna Kloska Jacek Rumiński |
author_facet | Krystian Zieliński Dajana Drabczyk Michał Kunicki Damian Drzyzga Anna Kloska Jacek Rumiński |
author_sort | Krystian Zieliński |
collection | DOAJ |
description | Abstract Background Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In this study, we employed machine learning algorithms to predict endometriosis based on the patient symptoms extracted from 13,933 questionnaires. We compared the results of feature selection obtained from various algorithms (i.e., Boruta algorithm, Recursive Feature Selection) with experts’ decisions. As a benchmark model architecture, we utilized a LightGBM algorithm, along with Multivariate Imputation by Chained Equations (MICE) and k-nearest neighbors (KNN), for missing data imputation. Our primary objective was to assess the model’s performance and feature importance compared to existing studies. Results We identified the top 20 predictors of endometriosis, uncovering previously overlooked features such as Cesarean section, ovarian cysts, and hernia. Notably, the model’s performance metrics were maximized when utilizing a combination of multiple feature selection methods. Specifically, the final model achieved an area under the receiver operator characteristic curve (AUC) of 0.85 on the training dataset and an AUC of 0.82 on the testing dataset. Conclusions The application of machine learning in diagnosing endometriosis has the potential to significantly impact clinical practice, streamlining the diagnostic process and enhancing efficiency. Our questionnaire-based prediction approach empowers individuals with endometriosis to proactively identify potential symptoms, facilitating informed discussions with healthcare professionals about diagnosis and treatment options. |
first_indexed | 2024-03-11T15:12:56Z |
format | Article |
id | doaj.art-932ba7b75a0b4628b97e077fffbab038 |
institution | Directory Open Access Journal |
issn | 1477-7827 |
language | English |
last_indexed | 2024-03-11T15:12:56Z |
publishDate | 2023-10-01 |
publisher | BMC |
record_format | Article |
series | Reproductive Biology and Endocrinology |
spelling | doaj.art-932ba7b75a0b4628b97e077fffbab0382023-10-29T12:40:31ZengBMCReproductive Biology and Endocrinology1477-78272023-10-0121111310.1186/s12958-023-01156-9Evaluating the risk of endometriosis based on patients’ self-assessment questionnairesKrystian Zieliński0Dajana Drabczyk1Michał Kunicki2Damian Drzyzga3Anna Kloska4Jacek Rumiński5INVICTA, Research and Development CenterINVICTA, Research and Development CenterINVICTA, Research and Development CenterINVICTA, Research and Development CenterINVICTA, Research and Development CenterDepartment of Biomedical Engineering, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of TechnologyAbstract Background Endometriosis is a condition that significantly affects the quality of life of about 10 % of reproductive-aged women. It is characterized by the presence of tissue similar to the uterine lining (endometrium) outside the uterus, which can lead lead scarring, adhesions, pain, and fertility issues. While numerous factors associated with endometriosis are documented, a wide range of symptoms may still be undiscovered. Methods In this study, we employed machine learning algorithms to predict endometriosis based on the patient symptoms extracted from 13,933 questionnaires. We compared the results of feature selection obtained from various algorithms (i.e., Boruta algorithm, Recursive Feature Selection) with experts’ decisions. As a benchmark model architecture, we utilized a LightGBM algorithm, along with Multivariate Imputation by Chained Equations (MICE) and k-nearest neighbors (KNN), for missing data imputation. Our primary objective was to assess the model’s performance and feature importance compared to existing studies. Results We identified the top 20 predictors of endometriosis, uncovering previously overlooked features such as Cesarean section, ovarian cysts, and hernia. Notably, the model’s performance metrics were maximized when utilizing a combination of multiple feature selection methods. Specifically, the final model achieved an area under the receiver operator characteristic curve (AUC) of 0.85 on the training dataset and an AUC of 0.82 on the testing dataset. Conclusions The application of machine learning in diagnosing endometriosis has the potential to significantly impact clinical practice, streamlining the diagnostic process and enhancing efficiency. Our questionnaire-based prediction approach empowers individuals with endometriosis to proactively identify potential symptoms, facilitating informed discussions with healthcare professionals about diagnosis and treatment options.https://doi.org/10.1186/s12958-023-01156-9EndometriosisQuestionnaireMachine learningSymptom-based predictionFertility |
spellingShingle | Krystian Zieliński Dajana Drabczyk Michał Kunicki Damian Drzyzga Anna Kloska Jacek Rumiński Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires Reproductive Biology and Endocrinology Endometriosis Questionnaire Machine learning Symptom-based prediction Fertility |
title | Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires |
title_full | Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires |
title_fullStr | Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires |
title_full_unstemmed | Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires |
title_short | Evaluating the risk of endometriosis based on patients’ self-assessment questionnaires |
title_sort | evaluating the risk of endometriosis based on patients self assessment questionnaires |
topic | Endometriosis Questionnaire Machine learning Symptom-based prediction Fertility |
url | https://doi.org/10.1186/s12958-023-01156-9 |
work_keys_str_mv | AT krystianzielinski evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires AT dajanadrabczyk evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires AT michałkunicki evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires AT damiandrzyzga evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires AT annakloska evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires AT jacekruminski evaluatingtheriskofendometriosisbasedonpatientsselfassessmentquestionnaires |