Characterizing phonetic transformations and fine-grained acoustic differences across dialects

Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2011.

Bibliographic Details
Main Author: Chen, Nancy Fang-Yih
Other Authors: Joseph P. Campbell.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2011
Subjects:
Online Access:http://hdl.handle.net/1721.1/65514
_version_ 1826210587144617984
author Chen, Nancy Fang-Yih
author2 Joseph P. Campbell.
author_facet Joseph P. Campbell.
Chen, Nancy Fang-Yih
author_sort Chen, Nancy Fang-Yih
collection MIT
description Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2011.
first_indexed 2024-09-23T14:52:01Z
format Thesis
id mit-1721.1/65514
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T14:52:01Z
publishDate 2011
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/655142019-04-14T07:23:52Z Characterizing phonetic transformations and fine-grained acoustic differences across dialects Chen, Nancy Fang-Yih Joseph P. Campbell. Harvard University--MIT Division of Health Sciences and Technology. Harvard University--MIT Division of Health Sciences and Technology. Harvard University--MIT Division of Health Sciences and Technology. Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2011. Cataloged from PDF version of thesis. Includes bibliographical references (p. 169-175). This thesis is motivated by the gaps between speech science and technology in analyzing dialects. In speech science, investigating phonetic rules is usually manually laborious and time consuming, limiting the amount of data analyzed. Without sufficient data, the analysis could potentially overlook or over-specify certain phonetic rules. On the other hand, in speech technology such as automatic dialect recognition, phonetic rules are rarely modeled explicitly. While many applications do not require such knowledge to obtain good performance, it is beneficial to specifically model pronunciation patterns in certain applications. For example, users of language learning software can benefit from explicit and intuitive feedback from the computer to alter their pronunciation; in forensic phonetics, it is important that results of automated systems are justifiable on phonetic grounds. In this work, we propose a mathematical framework to analyze dialects in terms of (1) phonetic transformations and (2) acoustic differences. The proposed Phonetic based Pronunciation Model (PPM) uses a hidden Markov model to characterize when and how often substitutions, insertions, and deletions occur. In particular, clustering methods are compared to better model deletion transformations. In addition, an acoustic counterpart of PPM, Acoustic-based Pronunciation Model (APM), is proposed to characterize and locate fine-grained acoustic differences such as formant transitions and nasalization across dialects. We used three data sets to empirically compare the proposed models in Arabic and English dialects. Results in automatic dialect recognition demonstrate that the proposed models complement standard baseline systems. Results in pronunciation generation and rule retrieval experiments indicate that the proposed models learn underlying phonetic rules across dialects. Our proposed system postulates pronunciation rules to a phonetician who interprets and refines them to discover new rules or quantify known rules. This can be done on large corpora to develop rules of greater statistical significance than has previously been possible. Potential applications of this work include speaker characterization and recognition, automatic dialect recognition, automatic speech recognition and synthesis, forensic phonetics, language learning or accent training education, and assistive diagnosis tools for speech and voice disorders. by Nancy Fang-Yih Chen. Ph.D. 2011-08-30T15:44:42Z 2011-08-30T15:44:42Z 2011 2011 Thesis http://hdl.handle.net/1721.1/65514 746791110 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 175 p. application/pdf Massachusetts Institute of Technology
spellingShingle Harvard University--MIT Division of Health Sciences and Technology.
Chen, Nancy Fang-Yih
Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title_full Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title_fullStr Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title_full_unstemmed Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title_short Characterizing phonetic transformations and fine-grained acoustic differences across dialects
title_sort characterizing phonetic transformations and fine grained acoustic differences across dialects
topic Harvard University--MIT Division of Health Sciences and Technology.
url http://hdl.handle.net/1721.1/65514
work_keys_str_mv AT chennancyfangyih characterizingphonetictransformationsandfinegrainedacousticdifferencesacrossdialects