Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model

Speech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method...

Full description

Bibliographic Details
Main Authors: Xiao Yao, Ning Xu, Xiaofeng Liu, Aimin Jiang, Xuewu Zhang
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8421212/
_version_ 1818877143165173760
author Xiao Yao
Ning Xu
Xiaofeng Liu
Aimin Jiang
Xuewu Zhang
author_facet Xiao Yao
Ning Xu
Xiaofeng Liu
Aimin Jiang
Xuewu Zhang
author_sort Xiao Yao
collection DOAJ
description Speech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method of study on speech under stress based on a physical speech production model and characteristics of glottal flow. The physical model is used to model glottal aerodynamics in the vocal system to represent speech production. The relationship between physical parameters and glottal flow parameters is explored based on the physical model, and the glottal source and physical model are linked. Through studying on the glottal and physical parameters for the neutral and for the speech under stress, features for speech under stress characterizing the vocal folds, vortex-flow interaction, and shape of glottal flow are compared with those of neutral speech. The relations between the proposed parameters and stress-speech production mechanism are discussed. Experiments show that physical parameters representing the stiffness and viscosity of vocal folds, subglottal pressure, and laryngeal ventricle strongly influence the glottal flow. The relations for physical parameters, glottal parameters, and stress production are revealed, and theoretical and experimental bases are provided for stress detection and classification in speech recognition system.
first_indexed 2024-12-19T13:53:36Z
format Article
id doaj.art-b2425d6b255d4aae84c7f63929c313ff
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-19T13:53:36Z
publishDate 2018-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-b2425d6b255d4aae84c7f63929c313ff2022-12-21T20:18:38ZengIEEEIEEE Access2169-35362018-01-016444734448210.1109/ACCESS.2018.28601308421212Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production ModelXiao Yao0https://orcid.org/0000-0003-4014-1708Ning Xu1Xiaofeng Liu2Aimin Jiang3Xuewu Zhang4College of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaSpeech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method of study on speech under stress based on a physical speech production model and characteristics of glottal flow. The physical model is used to model glottal aerodynamics in the vocal system to represent speech production. The relationship between physical parameters and glottal flow parameters is explored based on the physical model, and the glottal source and physical model are linked. Through studying on the glottal and physical parameters for the neutral and for the speech under stress, features for speech under stress characterizing the vocal folds, vortex-flow interaction, and shape of glottal flow are compared with those of neutral speech. The relations between the proposed parameters and stress-speech production mechanism are discussed. Experiments show that physical parameters representing the stiffness and viscosity of vocal folds, subglottal pressure, and laryngeal ventricle strongly influence the glottal flow. The relations for physical parameters, glottal parameters, and stress production are revealed, and theoretical and experimental bases are provided for stress detection and classification in speech recognition system.https://ieeexplore.ieee.org/document/8421212/Speech under stressphysical characteristicsglottal flowthe vocal foldsphysical model
spellingShingle Xiao Yao
Ning Xu
Xiaofeng Liu
Aimin Jiang
Xuewu Zhang
Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
IEEE Access
Speech under stress
physical characteristics
glottal flow
the vocal folds
physical model
title Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
title_full Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
title_fullStr Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
title_full_unstemmed Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
title_short Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
title_sort research on speech under stress based on glottal source using a physical speech production model
topic Speech under stress
physical characteristics
glottal flow
the vocal folds
physical model
url https://ieeexplore.ieee.org/document/8421212/
work_keys_str_mv AT xiaoyao researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel
AT ningxu researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel
AT xiaofengliu researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel
AT aiminjiang researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel
AT xuewuzhang researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel