Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>

Chagas disease, caused by the <i>Trypanosoma cruzi</i> (<i>T. cruzi</i>) parasite, is the third most common parasitosis worldwide. Most of the infected subjects can remain asymptomatic without an opportune and early detection or an objective diagnostic is not conducted. Frequ...

Full description

Bibliographic Details
Main Authors: Nidiyare Hevia-Montiel, Jorge Perez-Gonzalez, Antonio Neme, Paulina Haro
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/11/5/785
_version_ 1797475368838365184
author Nidiyare Hevia-Montiel
Jorge Perez-Gonzalez
Antonio Neme
Paulina Haro
author_facet Nidiyare Hevia-Montiel
Jorge Perez-Gonzalez
Antonio Neme
Paulina Haro
author_sort Nidiyare Hevia-Montiel
collection DOAJ
description Chagas disease, caused by the <i>Trypanosoma cruzi</i> (<i>T. cruzi</i>) parasite, is the third most common parasitosis worldwide. Most of the infected subjects can remain asymptomatic without an opportune and early detection or an objective diagnostic is not conducted. Frequently, the disease manifests itself after a long time, accompanied by severe heart disease or by sudden death. Thus, the diagnosis is a complex and challenging process where several factors must be considered. In this paper, a novel pipeline is presented integrating temporal data from four modalities (electrocardiography signals, echocardiography images, Doppler spectrum, and ELISA antibody titers), multiple features selection analyses by a univariate analysis and a machine learning-based selection. The method includes an automatic dichotomous classification of animal status (control vs. infected) based on Random Forest, Extremely Randomized Trees, Decision Trees, and Support Vector Machine. The most relevant multimodal attributes found were ELISA (IgGT, IgG1, IgG2a), electrocardiography (SR mean, QT and ST intervals), ascending aorta Doppler signals, and echocardiography (left ventricle diameter during diastole). Concerning automatic classification from selected features, the best accuracy of control vs. acute infection groups was 93.3 ± 13.3% for cross-validation and 100% in the final test; for control vs. chronic infection groups, it was 100% and 100%, respectively. We conclude that the proposed machine learning-based approach can be of help to obtain a robust and objective diagnosis in early <i>T. cruzi</i> infection stages.
first_indexed 2024-03-09T20:43:10Z
format Article
id doaj.art-d1c4cae1ba6b4455bfde741aa844eeec
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-09T20:43:10Z
publishDate 2022-03-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-d1c4cae1ba6b4455bfde741aa844eeec2023-11-23T22:53:56ZengMDPI AGElectronics2079-92922022-03-0111578510.3390/electronics11050785Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>Nidiyare Hevia-Montiel0Jorge Perez-Gonzalez1Antonio Neme2Paulina Haro3Unidad Académica del Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas del Estado de Yucatán, Universidad Nacional Autónoma de México, Mérida 97302, Yucatán, MexicoUnidad Académica del Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas del Estado de Yucatán, Universidad Nacional Autónoma de México, Mérida 97302, Yucatán, MexicoUnidad Académica del Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas del Estado de Yucatán, Universidad Nacional Autónoma de México, Mérida 97302, Yucatán, MexicoInstituto de Investigaciones en Ciencias Veterinarias, Universidad Autónoma de Baja California, Mexicali 21386, Baja California, MexicoChagas disease, caused by the <i>Trypanosoma cruzi</i> (<i>T. cruzi</i>) parasite, is the third most common parasitosis worldwide. Most of the infected subjects can remain asymptomatic without an opportune and early detection or an objective diagnostic is not conducted. Frequently, the disease manifests itself after a long time, accompanied by severe heart disease or by sudden death. Thus, the diagnosis is a complex and challenging process where several factors must be considered. In this paper, a novel pipeline is presented integrating temporal data from four modalities (electrocardiography signals, echocardiography images, Doppler spectrum, and ELISA antibody titers), multiple features selection analyses by a univariate analysis and a machine learning-based selection. The method includes an automatic dichotomous classification of animal status (control vs. infected) based on Random Forest, Extremely Randomized Trees, Decision Trees, and Support Vector Machine. The most relevant multimodal attributes found were ELISA (IgGT, IgG1, IgG2a), electrocardiography (SR mean, QT and ST intervals), ascending aorta Doppler signals, and echocardiography (left ventricle diameter during diastole). Concerning automatic classification from selected features, the best accuracy of control vs. acute infection groups was 93.3 ± 13.3% for cross-validation and 100% in the final test; for control vs. chronic infection groups, it was 100% and 100%, respectively. We conclude that the proposed machine learning-based approach can be of help to obtain a robust and objective diagnosis in early <i>T. cruzi</i> infection stages.https://www.mdpi.com/2079-9292/11/5/785machine learningfeature selectionmultivariate analysisclassificationChagas disease<i>Trypanosoma cruzi</i>
spellingShingle Nidiyare Hevia-Montiel
Jorge Perez-Gonzalez
Antonio Neme
Paulina Haro
Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
Electronics
machine learning
feature selection
multivariate analysis
classification
Chagas disease
<i>Trypanosoma cruzi</i>
title Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
title_full Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
title_fullStr Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
title_full_unstemmed Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
title_short Machine Learning-Based Feature Selection and Classification for the Experimental Diagnosis of <i>Trypanosoma cruzi</i>
title_sort machine learning based feature selection and classification for the experimental diagnosis of i trypanosoma cruzi i
topic machine learning
feature selection
multivariate analysis
classification
Chagas disease
<i>Trypanosoma cruzi</i>
url https://www.mdpi.com/2079-9292/11/5/785
work_keys_str_mv AT nidiyareheviamontiel machinelearningbasedfeatureselectionandclassificationfortheexperimentaldiagnosisofitrypanosomacruzii
AT jorgeperezgonzalez machinelearningbasedfeatureselectionandclassificationfortheexperimentaldiagnosisofitrypanosomacruzii
AT antonioneme machinelearningbasedfeatureselectionandclassificationfortheexperimentaldiagnosisofitrypanosomacruzii
AT paulinaharo machinelearningbasedfeatureselectionandclassificationfortheexperimentaldiagnosisofitrypanosomacruzii