Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model
Speech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8421212/ |
_version_ | 1818877143165173760 |
---|---|
author | Xiao Yao Ning Xu Xiaofeng Liu Aimin Jiang Xuewu Zhang |
author_facet | Xiao Yao Ning Xu Xiaofeng Liu Aimin Jiang Xuewu Zhang |
author_sort | Xiao Yao |
collection | DOAJ |
description | Speech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method of study on speech under stress based on a physical speech production model and characteristics of glottal flow. The physical model is used to model glottal aerodynamics in the vocal system to represent speech production. The relationship between physical parameters and glottal flow parameters is explored based on the physical model, and the glottal source and physical model are linked. Through studying on the glottal and physical parameters for the neutral and for the speech under stress, features for speech under stress characterizing the vocal folds, vortex-flow interaction, and shape of glottal flow are compared with those of neutral speech. The relations between the proposed parameters and stress-speech production mechanism are discussed. Experiments show that physical parameters representing the stiffness and viscosity of vocal folds, subglottal pressure, and laryngeal ventricle strongly influence the glottal flow. The relations for physical parameters, glottal parameters, and stress production are revealed, and theoretical and experimental bases are provided for stress detection and classification in speech recognition system. |
first_indexed | 2024-12-19T13:53:36Z |
format | Article |
id | doaj.art-b2425d6b255d4aae84c7f63929c313ff |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-19T13:53:36Z |
publishDate | 2018-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-b2425d6b255d4aae84c7f63929c313ff2022-12-21T20:18:38ZengIEEEIEEE Access2169-35362018-01-016444734448210.1109/ACCESS.2018.28601308421212Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production ModelXiao Yao0https://orcid.org/0000-0003-4014-1708Ning Xu1Xiaofeng Liu2Aimin Jiang3Xuewu Zhang4College of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaCollege of Internet of Things Engineering, Hohai University, Changzhou, ChinaSpeech recognition accuracy is severely reduced by the variability caused by stress. Considering the fact that speech under stress is caused by the physiological changes of the vocal folds in the physiological system whose vibration behavior is reflected by glottal flow, this paper presents a method of study on speech under stress based on a physical speech production model and characteristics of glottal flow. The physical model is used to model glottal aerodynamics in the vocal system to represent speech production. The relationship between physical parameters and glottal flow parameters is explored based on the physical model, and the glottal source and physical model are linked. Through studying on the glottal and physical parameters for the neutral and for the speech under stress, features for speech under stress characterizing the vocal folds, vortex-flow interaction, and shape of glottal flow are compared with those of neutral speech. The relations between the proposed parameters and stress-speech production mechanism are discussed. Experiments show that physical parameters representing the stiffness and viscosity of vocal folds, subglottal pressure, and laryngeal ventricle strongly influence the glottal flow. The relations for physical parameters, glottal parameters, and stress production are revealed, and theoretical and experimental bases are provided for stress detection and classification in speech recognition system.https://ieeexplore.ieee.org/document/8421212/Speech under stressphysical characteristicsglottal flowthe vocal foldsphysical model |
spellingShingle | Xiao Yao Ning Xu Xiaofeng Liu Aimin Jiang Xuewu Zhang Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model IEEE Access Speech under stress physical characteristics glottal flow the vocal folds physical model |
title | Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model |
title_full | Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model |
title_fullStr | Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model |
title_full_unstemmed | Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model |
title_short | Research on Speech Under Stress Based on Glottal Source Using a Physical Speech Production Model |
title_sort | research on speech under stress based on glottal source using a physical speech production model |
topic | Speech under stress physical characteristics glottal flow the vocal folds physical model |
url | https://ieeexplore.ieee.org/document/8421212/ |
work_keys_str_mv | AT xiaoyao researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel AT ningxu researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel AT xiaofengliu researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel AT aiminjiang researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel AT xuewuzhang researchonspeechunderstressbasedonglottalsourceusingaphysicalspeechproductionmodel |