Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments
Objective: To investigate the potential of voice analysis as a prescreening or monitoring tool for type 2 diabetes mellitus (T2DM) by examining the differences in voice recordings between nondiabetic and T2DM individuals. Patients and Methods: Total 267 participants diagnosed as nondiabetic (79 wome...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2023-12-01
|
Series: | Mayo Clinic Proceedings: Digital Health |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2949761223000731 |
_version_ | 1827304177985912832 |
---|---|
author | Jaycee M. Kaufman, MSc Anirudh Thommandram, MASc Yan Fossat, MSc |
author_facet | Jaycee M. Kaufman, MSc Anirudh Thommandram, MASc Yan Fossat, MSc |
author_sort | Jaycee M. Kaufman, MSc |
collection | DOAJ |
description | Objective: To investigate the potential of voice analysis as a prescreening or monitoring tool for type 2 diabetes mellitus (T2DM) by examining the differences in voice recordings between nondiabetic and T2DM individuals. Patients and Methods: Total 267 participants diagnosed as nondiabetic (79 women and 113 men) or T2DM (18 women and 57 men) on the basis of American Diabetes Association guidelines were recruited in India between August 30, 2021 and June 30, 2022. Using a smartphone application, participants recorded a fixed phrase up to 6 times daily for 2 weeks, resulting in 18,465 recordings. Fourteen acoustic features were extracted from each recording to analyze differences between nondiabetic and T2DM individuals and create a prediction methodology for T2DM status. Results: Significant differences were found between voice recordings of nondiabetic and T2DM men and women, both in the entire dataset and in an age-matched and body mass index (BMI [calculated as the weight in kilograms divided by the height in meters squared])-matched sample. The highest predictive accuracy was achieved by pitch (P<.0001), pitch SD (P<.0001), and relative average pertubation jitter (P=.02) for women, and intensity (P<.0001) and 11-point amplitude perturbation quotient shimmer (apq11, P<0.0001) for men. Incorporating these features with age and BMI, the optimal prediction models achieved accuracies of 0.75±0.22 for women and 0.70±0.10 for men through 5-fold cross-validation in the age-matched and BMI-matched sample. Conclusion: Overall, vocal changes occur in individuals with T2DM compared with those without T2DM. Voice analysis shows potential as a prescreening or monitoring tool for T2DM, particularly when combined with other risk factors associated with the condition. Trial Registration: clinicaltrials.gov Identifier: CTRI/2021/08/035957 |
first_indexed | 2024-04-24T17:25:28Z |
format | Article |
id | doaj.art-0541415560ab44a49dc90609968c1fbc |
institution | Directory Open Access Journal |
issn | 2949-7612 |
language | English |
last_indexed | 2024-04-24T17:25:28Z |
publishDate | 2023-12-01 |
publisher | Elsevier |
record_format | Article |
series | Mayo Clinic Proceedings: Digital Health |
spelling | doaj.art-0541415560ab44a49dc90609968c1fbc2024-03-28T06:40:52ZengElsevierMayo Clinic Proceedings: Digital Health2949-76122023-12-0114534544Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice SegmentsJaycee M. Kaufman, MSc0Anirudh Thommandram, MASc1Yan Fossat, MSc2Klick Applied Sciences, Klick Inc, Toronto, Canada; Correspondence: Jaycee M. Kaufman, MSc, 175 Bloor St E Suite 300, Toronto, ON, Canada M4W 3R8.Klick Applied Sciences, Klick Inc, Toronto, CanadaKlick Applied Sciences, Klick Inc, Toronto, Canada; Faculty of Science, Ontario Tech University, Oshawa, CanadaObjective: To investigate the potential of voice analysis as a prescreening or monitoring tool for type 2 diabetes mellitus (T2DM) by examining the differences in voice recordings between nondiabetic and T2DM individuals. Patients and Methods: Total 267 participants diagnosed as nondiabetic (79 women and 113 men) or T2DM (18 women and 57 men) on the basis of American Diabetes Association guidelines were recruited in India between August 30, 2021 and June 30, 2022. Using a smartphone application, participants recorded a fixed phrase up to 6 times daily for 2 weeks, resulting in 18,465 recordings. Fourteen acoustic features were extracted from each recording to analyze differences between nondiabetic and T2DM individuals and create a prediction methodology for T2DM status. Results: Significant differences were found between voice recordings of nondiabetic and T2DM men and women, both in the entire dataset and in an age-matched and body mass index (BMI [calculated as the weight in kilograms divided by the height in meters squared])-matched sample. The highest predictive accuracy was achieved by pitch (P<.0001), pitch SD (P<.0001), and relative average pertubation jitter (P=.02) for women, and intensity (P<.0001) and 11-point amplitude perturbation quotient shimmer (apq11, P<0.0001) for men. Incorporating these features with age and BMI, the optimal prediction models achieved accuracies of 0.75±0.22 for women and 0.70±0.10 for men through 5-fold cross-validation in the age-matched and BMI-matched sample. Conclusion: Overall, vocal changes occur in individuals with T2DM compared with those without T2DM. Voice analysis shows potential as a prescreening or monitoring tool for T2DM, particularly when combined with other risk factors associated with the condition. Trial Registration: clinicaltrials.gov Identifier: CTRI/2021/08/035957http://www.sciencedirect.com/science/article/pii/S2949761223000731 |
spellingShingle | Jaycee M. Kaufman, MSc Anirudh Thommandram, MASc Yan Fossat, MSc Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments Mayo Clinic Proceedings: Digital Health |
title | Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments |
title_full | Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments |
title_fullStr | Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments |
title_full_unstemmed | Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments |
title_short | Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments |
title_sort | acoustic analysis and prediction of type 2 diabetes mellitus using smartphone recorded voice segments |
url | http://www.sciencedirect.com/science/article/pii/S2949761223000731 |
work_keys_str_mv | AT jayceemkaufmanmsc acousticanalysisandpredictionoftype2diabetesmellitususingsmartphonerecordedvoicesegments AT anirudhthommandrammasc acousticanalysisandpredictionoftype2diabetesmellitususingsmartphonerecordedvoicesegments AT yanfossatmsc acousticanalysisandpredictionoftype2diabetesmellitususingsmartphonerecordedvoicesegments |