Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review

BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, p...

Full description

Bibliographic Details
Main Authors: Alper Idrisoglu, Ana Luiza Dallora, Peter Anderberg, Johan Sanmartin Berglund
Format: Article
Language:English
Published: JMIR Publications 2023-07-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2023/1/e46105
_version_ 1797733994542923776
author Alper Idrisoglu
Ana Luiza Dallora
Peter Anderberg
Johan Sanmartin Berglund
author_facet Alper Idrisoglu
Ana Luiza Dallora
Peter Anderberg
Johan Sanmartin Berglund
author_sort Alper Idrisoglu
collection DOAJ
description BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. ObjectiveThis study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. MethodsThis systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. ResultsIn the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network–based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. ConclusionsThis SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.
first_indexed 2024-03-12T12:37:48Z
format Article
id doaj.art-dd15b2a5beae4ae38306d15f24f10879
institution Directory Open Access Journal
issn 1438-8871
language English
last_indexed 2024-03-12T12:37:48Z
publishDate 2023-07-01
publisher JMIR Publications
record_format Article
series Journal of Medical Internet Research
spelling doaj.art-dd15b2a5beae4ae38306d15f24f108792023-08-29T00:05:04ZengJMIR PublicationsJournal of Medical Internet Research1438-88712023-07-0125e4610510.2196/46105Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature ReviewAlper Idrisogluhttps://orcid.org/0000-0003-1558-2309Ana Luiza Dallorahttps://orcid.org/0000-0002-6752-017XPeter Anderberghttps://orcid.org/0000-0001-9870-8477Johan Sanmartin Berglundhttps://orcid.org/0000-0003-4312-2246 BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. ObjectiveThis study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. MethodsThis systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. ResultsIn the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network–based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. ConclusionsThis SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.https://www.jmir.org/2023/1/e46105
spellingShingle Alper Idrisoglu
Ana Luiza Dallora
Peter Anderberg
Johan Sanmartin Berglund
Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
Journal of Medical Internet Research
title Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
title_full Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
title_fullStr Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
title_full_unstemmed Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
title_short Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
title_sort applied machine learning techniques to diagnose voice affecting conditions and disorders systematic literature review
url https://www.jmir.org/2023/1/e46105
work_keys_str_mv AT alperidrisoglu appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview
AT analuizadallora appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview
AT peteranderberg appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview
AT johansanmartinberglund appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview