End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model

In this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typi...

Full description

Bibliographic Details
Main Authors:	Jung Hyuk Lee, Geon Woo Lee, Guiyoung Bong, Hee Jeong Yoo, Hong Kook Kim
Format:	Article
Language:	English
Published:	MDPI AG 2022-12-01
Series:	Sensors
Subjects:	autism spectrum disorder end-to-end neural network pretrained model joint optimization autoencoder bidirectional long short-term memory (BLSTM)
Online Access:	https://www.mdpi.com/1424-8220/23/1/202

_version_	1797439709684695040
author	Jung Hyuk Lee Geon Woo Lee Guiyoung Bong Hee Jeong Yoo Hong Kook Kim
author_facet	Jung Hyuk Lee Geon Woo Lee Guiyoung Bong Hee Jeong Yoo Hong Kook Kim
author_sort	Jung Hyuk Lee
collection	DOAJ
description	In this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typical development (TD), we combined two different feature-extraction models and a bidirectional long short-term memory (BLSTM)-based classifier to obtain the ASD/TD classification in the form of probability. We realized one of the feature extractors as the bottleneck feature from an autoencoder using the extended version of the Geneva minimalistic acoustic parameter set (eGeMAPS) input. The other feature extractor is the context vector from a pretrained wav2vec2.0-based model directly applied to the waveform input. In addition, we optimized the E2E models in two different ways: (1) fine-tuning and (2) joint optimization. To evaluate the performance of the proposed E2E models, we prepared two datasets from video recordings of ASD diagnoses collected between 2016 and 2018 at Seoul National University Bundang Hospital (SNUBH), and between 2019 and 2021 at a Living Lab. According to the experimental results, the proposed wav2vec2.0-based E2E model with joint optimization achieved significant improvements in the accuracy and unweighted average recall, from 64.74% to 71.66% and from 65.04% to 70.81%, respectively, compared with a conventional model using autoencoder-based BLSTM and the deterministic features of the eGeMAPS.
first_indexed	2024-03-09T11:57:01Z
format	Article
id	doaj.art-e24dd8ce33574483861e7f9e4396355e
institution	Directory Open Access Journal
issn	1424-8220
language	English
last_indexed	2024-03-09T11:57:01Z
publishDate	2022-12-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj.art-e24dd8ce33574483861e7f9e4396355e2023-11-30T23:08:07ZengMDPI AGSensors1424-82202022-12-0123120210.3390/s23010202End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained ModelJung Hyuk Lee0Geon Woo Lee1Guiyoung Bong2Hee Jeong Yoo3Hong Kook Kim4School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of KoreaAI Graduate School, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of KoreaDepartment of Psychiatry, Seoul National University Bundang Hospital, Seongnam 13620, Republic of KoreaDepartment of Psychiatry, Seoul National University Bundang Hospital, Seongnam 13620, Republic of KoreaSchool of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of KoreaIn this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typical development (TD), we combined two different feature-extraction models and a bidirectional long short-term memory (BLSTM)-based classifier to obtain the ASD/TD classification in the form of probability. We realized one of the feature extractors as the bottleneck feature from an autoencoder using the extended version of the Geneva minimalistic acoustic parameter set (eGeMAPS) input. The other feature extractor is the context vector from a pretrained wav2vec2.0-based model directly applied to the waveform input. In addition, we optimized the E2E models in two different ways: (1) fine-tuning and (2) joint optimization. To evaluate the performance of the proposed E2E models, we prepared two datasets from video recordings of ASD diagnoses collected between 2016 and 2018 at Seoul National University Bundang Hospital (SNUBH), and between 2019 and 2021 at a Living Lab. According to the experimental results, the proposed wav2vec2.0-based E2E model with joint optimization achieved significant improvements in the accuracy and unweighted average recall, from 64.74% to 71.66% and from 65.04% to 70.81%, respectively, compared with a conventional model using autoencoder-based BLSTM and the deterministic features of the eGeMAPS.https://www.mdpi.com/1424-8220/23/1/202autism spectrum disorderend-to-end neural networkpretrained modeljoint optimizationautoencoderbidirectional long short-term memory (BLSTM)
spellingShingle	Jung Hyuk Lee Geon Woo Lee Guiyoung Bong Hee Jeong Yoo Hong Kook Kim End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model Sensors autism spectrum disorder end-to-end neural network pretrained model joint optimization autoencoder bidirectional long short-term memory (BLSTM)
title	End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
title_full	End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
title_fullStr	End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
title_full_unstemmed	End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
title_short	End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
title_sort	end to end model based detection of infants with autism spectrum disorder using a pretrained model
topic	autism spectrum disorder end-to-end neural network pretrained model joint optimization autoencoder bidirectional long short-term memory (BLSTM)
url	https://www.mdpi.com/1424-8220/23/1/202
work_keys_str_mv	AT junghyuklee endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT geonwoolee endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT guiyoungbong endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT heejeongyoo endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT hongkookkim endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel

End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model

Similar Items