ORIENTATE: automated machine learning classifiers for oral health prediction and research
Abstract Background The application of data-driven methods is expected to play an increasingly important role in healthcare. However, a lack of personnel with the necessary skills to develop these models and interpret its output is preventing a wider adoption of these methods. To address this gap, w...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-06-01
|
Series: | BMC Oral Health |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12903-023-03112-w |
_version_ | 1797795487951093760 |
---|---|
author | Inmaculada Gomez-Rios Esteban Egea-Lopez Antonio José Ortiz Ruiz |
author_facet | Inmaculada Gomez-Rios Esteban Egea-Lopez Antonio José Ortiz Ruiz |
author_sort | Inmaculada Gomez-Rios |
collection | DOAJ |
description | Abstract Background The application of data-driven methods is expected to play an increasingly important role in healthcare. However, a lack of personnel with the necessary skills to develop these models and interpret its output is preventing a wider adoption of these methods. To address this gap, we introduce and describe ORIENTATE, a software for automated application of machine learning classification algorithms by clinical practitioners lacking specific technical skills. ORIENTATE allows the selection of features and the target variable, then automatically generates a number of classification models and cross-validates them, finding the best model and evaluating it. It also implements a custom feature selection algorithm for systematic searches of the best combination of predictors for a given target variable. Finally, it outputs a comprehensive report with graphs that facilitates the explanation of the classification model results, using global interpretation methods, and an interface for the prediction of new input samples. Feature relevance and interaction plots provided by ORIENTATE allow to use it for statistical inference, which can replace and/or complement classical statistical studies. Results Its application to a dataset with healthy and special health care needs (SHCN) children, treated under deep sedation, was discussed as case study. On the example dataset, despite its small size, the feature selection algorithm found a set of features able to predict the need for a second sedation with a f1 score of 0.83 and a ROC (AUC) of 0.92. Eight predictive factors for both populations were found and ordered by the relevance assigned to them by the model. A discussion of how to derive inferences from the relevance and interaction plots and a comparison with a classical study is also provided. Conclusions ORIENTATE automatically finds suitable features and generates accurate classifiers which can be used in preventive tasks. In addition, researchers without specific skills on data methods can use it for the application of machine learning classification and as a complement to classical studies for inferential analysis of features. In the case study, a high prediction accuracy for a second sedation in SHCN children was achieved. The analysis of the relevance of the features showed that the number of teeth with pulpar treatments at the first sedation is a predictive factor for a second sedation. |
first_indexed | 2024-03-13T03:18:42Z |
format | Article |
id | doaj.art-fe62c30b675847f580489a4502573be8 |
institution | Directory Open Access Journal |
issn | 1472-6831 |
language | English |
last_indexed | 2024-03-13T03:18:42Z |
publishDate | 2023-06-01 |
publisher | BMC |
record_format | Article |
series | BMC Oral Health |
spelling | doaj.art-fe62c30b675847f580489a4502573be82023-06-25T11:30:57ZengBMCBMC Oral Health1472-68312023-06-0123111910.1186/s12903-023-03112-wORIENTATE: automated machine learning classifiers for oral health prediction and researchInmaculada Gomez-Rios0Esteban Egea-Lopez1Antonio José Ortiz Ruiz2Department of Dermatology, Stomatology, Radiology and Physical Medicine, Universidad de MurciaDept. Information Technologies and Communications, Universidad Politecnica de Cartagena (UPCT)Department of Dermatology, Stomatology, Radiology and Physical Medicine, Universidad de MurciaAbstract Background The application of data-driven methods is expected to play an increasingly important role in healthcare. However, a lack of personnel with the necessary skills to develop these models and interpret its output is preventing a wider adoption of these methods. To address this gap, we introduce and describe ORIENTATE, a software for automated application of machine learning classification algorithms by clinical practitioners lacking specific technical skills. ORIENTATE allows the selection of features and the target variable, then automatically generates a number of classification models and cross-validates them, finding the best model and evaluating it. It also implements a custom feature selection algorithm for systematic searches of the best combination of predictors for a given target variable. Finally, it outputs a comprehensive report with graphs that facilitates the explanation of the classification model results, using global interpretation methods, and an interface for the prediction of new input samples. Feature relevance and interaction plots provided by ORIENTATE allow to use it for statistical inference, which can replace and/or complement classical statistical studies. Results Its application to a dataset with healthy and special health care needs (SHCN) children, treated under deep sedation, was discussed as case study. On the example dataset, despite its small size, the feature selection algorithm found a set of features able to predict the need for a second sedation with a f1 score of 0.83 and a ROC (AUC) of 0.92. Eight predictive factors for both populations were found and ordered by the relevance assigned to them by the model. A discussion of how to derive inferences from the relevance and interaction plots and a comparison with a classical study is also provided. Conclusions ORIENTATE automatically finds suitable features and generates accurate classifiers which can be used in preventive tasks. In addition, researchers without specific skills on data methods can use it for the application of machine learning classification and as a complement to classical studies for inferential analysis of features. In the case study, a high prediction accuracy for a second sedation in SHCN children was achieved. The analysis of the relevance of the features showed that the number of teeth with pulpar treatments at the first sedation is a predictive factor for a second sedation.https://doi.org/10.1186/s12903-023-03112-wMachine learningClassificationSpecial health care needsDeep sedationPredictive dentistrySecond sedation risk |
spellingShingle | Inmaculada Gomez-Rios Esteban Egea-Lopez Antonio José Ortiz Ruiz ORIENTATE: automated machine learning classifiers for oral health prediction and research BMC Oral Health Machine learning Classification Special health care needs Deep sedation Predictive dentistry Second sedation risk |
title | ORIENTATE: automated machine learning classifiers for oral health prediction and research |
title_full | ORIENTATE: automated machine learning classifiers for oral health prediction and research |
title_fullStr | ORIENTATE: automated machine learning classifiers for oral health prediction and research |
title_full_unstemmed | ORIENTATE: automated machine learning classifiers for oral health prediction and research |
title_short | ORIENTATE: automated machine learning classifiers for oral health prediction and research |
title_sort | orientate automated machine learning classifiers for oral health prediction and research |
topic | Machine learning Classification Special health care needs Deep sedation Predictive dentistry Second sedation risk |
url | https://doi.org/10.1186/s12903-023-03112-w |
work_keys_str_mv | AT inmaculadagomezrios orientateautomatedmachinelearningclassifiersfororalhealthpredictionandresearch AT estebanegealopez orientateautomatedmachinelearningclassifiersfororalhealthpredictionandresearch AT antoniojoseortizruiz orientateautomatedmachinelearningclassifiersfororalhealthpredictionandresearch |