Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review
BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, p...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
JMIR Publications
2023-07-01
|
Series: | Journal of Medical Internet Research |
Online Access: | https://www.jmir.org/2023/1/e46105 |
_version_ | 1797733994542923776 |
---|---|
author | Alper Idrisoglu Ana Luiza Dallora Peter Anderberg Johan Sanmartin Berglund |
author_facet | Alper Idrisoglu Ana Luiza Dallora Peter Anderberg Johan Sanmartin Berglund |
author_sort | Alper Idrisoglu |
collection | DOAJ |
description |
BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems.
ObjectiveThis study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest.
MethodsThis systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group.
ResultsIn the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network–based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models.
ConclusionsThis SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research. |
first_indexed | 2024-03-12T12:37:48Z |
format | Article |
id | doaj.art-dd15b2a5beae4ae38306d15f24f10879 |
institution | Directory Open Access Journal |
issn | 1438-8871 |
language | English |
last_indexed | 2024-03-12T12:37:48Z |
publishDate | 2023-07-01 |
publisher | JMIR Publications |
record_format | Article |
series | Journal of Medical Internet Research |
spelling | doaj.art-dd15b2a5beae4ae38306d15f24f108792023-08-29T00:05:04ZengJMIR PublicationsJournal of Medical Internet Research1438-88712023-07-0125e4610510.2196/46105Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature ReviewAlper Idrisogluhttps://orcid.org/0000-0003-1558-2309Ana Luiza Dallorahttps://orcid.org/0000-0002-6752-017XPeter Anderberghttps://orcid.org/0000-0001-9870-8477Johan Sanmartin Berglundhttps://orcid.org/0000-0003-4312-2246 BackgroundNormal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. ObjectiveThis study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. MethodsThis systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. ResultsIn the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network–based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. ConclusionsThis SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.https://www.jmir.org/2023/1/e46105 |
spellingShingle | Alper Idrisoglu Ana Luiza Dallora Peter Anderberg Johan Sanmartin Berglund Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review Journal of Medical Internet Research |
title | Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review |
title_full | Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review |
title_fullStr | Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review |
title_full_unstemmed | Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review |
title_short | Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review |
title_sort | applied machine learning techniques to diagnose voice affecting conditions and disorders systematic literature review |
url | https://www.jmir.org/2023/1/e46105 |
work_keys_str_mv | AT alperidrisoglu appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview AT analuizadallora appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview AT peteranderberg appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview AT johansanmartinberglund appliedmachinelearningtechniquestodiagnosevoiceaffectingconditionsanddisorderssystematicliteraturereview |