A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network

Recent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the...

Full description

Bibliographic Details
Main Authors: Momoko Ishimaru, Yoshifumi Okada, Ryunosuke Uchiyama, Ryo Horiguchi, Itsuki Toyoshima
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Diagnostics
Subjects:
Online Access:https://www.mdpi.com/2075-4418/13/4/727
_version_ 1827757824405405696
author Momoko Ishimaru
Yoshifumi Okada
Ryunosuke Uchiyama
Ryo Horiguchi
Itsuki Toyoshima
author_facet Momoko Ishimaru
Yoshifumi Okada
Ryunosuke Uchiyama
Ryo Horiguchi
Itsuki Toyoshima
author_sort Momoko Ishimaru
collection DOAJ
description Recent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the depression severity using audio data. However, existing methods have assumed that the individual audio features are independent. Hence, in this paper, we propose a new deep learning–based regression model that allows for the prediction of depression severity on the basis of the correlation among audio features. The proposed model was developed using a graph convolutional neural network. This model trains the voice characteristics using graph-structured data generated to express the correlation among audio features. We conducted prediction experiments on depression severity using the DAIC-WOZ dataset employed in several previous studies. The experimental results showed that the proposed model achieved a root mean square error (RMSE) of 2.15, a mean absolute error (MAE) of 1.25, and a symmetric mean absolute percentage error of 50.96%. Notably, RMSE and MAE significantly outperformed the existing state-of-the-art prediction methods. From these results, we conclude that the proposed model can be a promising tool for depression diagnosis.
first_indexed 2024-03-11T08:56:57Z
format Article
id doaj.art-67ff5917e0c3475ca10a07f3d10e61d5
institution Directory Open Access Journal
issn 2075-4418
language English
last_indexed 2024-03-11T08:56:57Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Diagnostics
spelling doaj.art-67ff5917e0c3475ca10a07f3d10e61d52023-11-16T20:02:03ZengMDPI AGDiagnostics2075-44182023-02-0113472710.3390/diagnostics13040727A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural NetworkMomoko Ishimaru0Yoshifumi Okada1Ryunosuke Uchiyama2Ryo Horiguchi3Itsuki Toyoshima4Division of Information and Electronic Engineering, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, JapanCollege of Information and Systems, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, JapanDivision of Information and Electronic Engineering, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, JapanDivision of Information and Electronic Engineering, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, JapanDivision of Information and Electronic Engineering, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, JapanRecent studies have revealed mutually correlated audio features in the voices of depressed patients. Thus, the voices of these patients can be characterized based on the combinatorial relationships among the audio features. To date, many deep learning–based methods have been proposed to predict the depression severity using audio data. However, existing methods have assumed that the individual audio features are independent. Hence, in this paper, we propose a new deep learning–based regression model that allows for the prediction of depression severity on the basis of the correlation among audio features. The proposed model was developed using a graph convolutional neural network. This model trains the voice characteristics using graph-structured data generated to express the correlation among audio features. We conducted prediction experiments on depression severity using the DAIC-WOZ dataset employed in several previous studies. The experimental results showed that the proposed model achieved a root mean square error (RMSE) of 2.15, a mean absolute error (MAE) of 1.25, and a symmetric mean absolute percentage error of 50.96%. Notably, RMSE and MAE significantly outperformed the existing state-of-the-art prediction methods. From these results, we conclude that the proposed model can be a promising tool for depression diagnosis.https://www.mdpi.com/2075-4418/13/4/727audio featuredepressionregression modelcorrelationgraph convolutional neural network
spellingShingle Momoko Ishimaru
Yoshifumi Okada
Ryunosuke Uchiyama
Ryo Horiguchi
Itsuki Toyoshima
A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
Diagnostics
audio feature
depression
regression model
correlation
graph convolutional neural network
title A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_full A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_fullStr A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_full_unstemmed A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_short A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network
title_sort new regression model for depression severity prediction based on correlation among audio features using a graph convolutional neural network
topic audio feature
depression
regression model
correlation
graph convolutional neural network
url https://www.mdpi.com/2075-4418/13/4/727
work_keys_str_mv AT momokoishimaru anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT yoshifumiokada anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT ryunosukeuchiyama anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT ryohoriguchi anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT itsukitoyoshima anewregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT momokoishimaru newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT yoshifumiokada newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT ryunosukeuchiyama newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT ryohoriguchi newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork
AT itsukitoyoshima newregressionmodelfordepressionseveritypredictionbasedoncorrelationamongaudiofeaturesusingagraphconvolutionalneuralnetwork