Classification of vocal fold vibration as regular or irregular in normal, voiced speech

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.

Bibliographic Details
Main Author: Surana, Kushan Krishna
Other Authors: Janet Slifka.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2007
Subjects:
Online Access:http://hdl.handle.net/1721.1/37104
_version_ 1811074377310011392
author Surana, Kushan Krishna
author2 Janet Slifka.
author_facet Janet Slifka.
Surana, Kushan Krishna
author_sort Surana, Kushan Krishna
collection MIT
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006.
first_indexed 2024-09-23T09:48:06Z
format Thesis
id mit-1721.1/37104
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T09:48:06Z
publishDate 2007
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/371042019-04-10T09:05:35Z Classification of vocal fold vibration as regular or irregular in normal, voiced speech Surana, Kushan Krishna Janet Slifka. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (p. 91-97). Irregular phonation serves an important communicative function in human speech and occurs allophonically in American English. This thesis uses cues from both the temporal and frequency domains - such as fundamental frequency, normalized RMS amplitude, smoothed-energy-difference amplitude (a measure of abruptness in energy variations) and shift-difference amplitude (a measures of periodicity) -to classify segments of regular and irregular phonation in normal, continuous speech. Support Vector Machines (SVMs) are used to classify the tokens as examples of either regular or irregular phonation. The tokens are extracted from the TIMIT database, and are extracted from 151 different speakers. Both genders are well represented, and the tokens occur in various contexts within the utterance. The train-set uses 114 different speakers, while the test-set uses another 37 speakers. A total of 292 of 320 irregular tokens (recognition rate of 91.25% with a false alarm rate of 4.98%), and 4105 of 4320 regular tokens (recognition rate of 95.02% with a false alarm rate of 8.75%) are correctly identified. (cont.) The high recognition rates are an indicator that the set of acoustic cues are robust in accurately identifying a token as regular or irregular, even in cases where one or two acoustic cues show unexpected values. Also, analysis of irregular tokens in the training set (1331 irregular tokens) shows that 78% occur at word boundaries and 5% occur at syllable boundaries. Of the irregular tokens at syllable boundaries, 72% are either at the junction of a compound-word (e.g "outcast;") or at the junction of a base word and a suffix. Of the irregular tokens which do not occur at word or syllable boundaries, 70% occur adjacent to voiceless consonants mostly in utterance-final location. These observations support irregular phonation as a cue for syntactic boundaries in connected speech, and combined with the robust classification results to separate regular phonation from irregular phonation, could be used to improve speech recognition and lexical access models. by Kushan Krishna Surana. M.Eng. 2007-04-03T17:11:57Z 2007-04-03T17:11:57Z 2006 2006 Thesis http://hdl.handle.net/1721.1/37104 84908823 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 97 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Surana, Kushan Krishna
Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title_full Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title_fullStr Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title_full_unstemmed Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title_short Classification of vocal fold vibration as regular or irregular in normal, voiced speech
title_sort classification of vocal fold vibration as regular or irregular in normal voiced speech
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/37104
work_keys_str_mv AT suranakushankrishna classificationofvocalfoldvibrationasregularorirregularinnormalvoicedspeech