Contextual Predictability and Phonetic Reduction

Phonetic reduction is a process which alters the acoustic quality of a sound, often a vowel or word, to a perceived weaker or shorter state. Previous research suggests that the degree of reduction of a word is influenced by the contextual predictability of words in the context. However, the nature o...

Full description

Bibliographic Details
Main Author: Martin, Kinan R.
Other Authors: Levy, Roger
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156991
_version_ 1824458246410207232
author Martin, Kinan R.
author2 Levy, Roger
author_facet Levy, Roger
Martin, Kinan R.
author_sort Martin, Kinan R.
collection MIT
description Phonetic reduction is a process which alters the acoustic quality of a sound, often a vowel or word, to a perceived weaker or shorter state. Previous research suggests that the degree of reduction of a word is influenced by the contextual predictability of words in the context. However, the nature of how the context direction and size governs phonetic reduction has not been thoroughly explored. The advancement of self-supervised language models provides a means to assign meaningful estimates of word predictability conditioned on different contexts. This paper explores the effect of contextual predictability on phonetic reduction making use of such models. We train instances of GPT-2 on different context directions (past, future, and bidirectional) and context sizes (bigram vs. sentence) to provide measures of conditional word predictability, then use linear regression to quantify their correlation with a measure of phonetic reduction (word duration). Our results provide evidence suggesting that the contextual probability of a word given the following context correlates with word duration more strongly than the past context and the bidirectional contexts for both context sizes, suggesting that phonetic reduction may be a reliable indicator of reduced cognitive load in a speaker’s planning of the rest of an utterance.
first_indexed 2025-02-19T04:22:50Z
format Thesis
id mit-1721.1/156991
institution Massachusetts Institute of Technology
last_indexed 2025-02-19T04:22:50Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1569912024-09-25T03:29:36Z Contextual Predictability and Phonetic Reduction Martin, Kinan R. Levy, Roger Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Phonetic reduction is a process which alters the acoustic quality of a sound, often a vowel or word, to a perceived weaker or shorter state. Previous research suggests that the degree of reduction of a word is influenced by the contextual predictability of words in the context. However, the nature of how the context direction and size governs phonetic reduction has not been thoroughly explored. The advancement of self-supervised language models provides a means to assign meaningful estimates of word predictability conditioned on different contexts. This paper explores the effect of contextual predictability on phonetic reduction making use of such models. We train instances of GPT-2 on different context directions (past, future, and bidirectional) and context sizes (bigram vs. sentence) to provide measures of conditional word predictability, then use linear regression to quantify their correlation with a measure of phonetic reduction (word duration). Our results provide evidence suggesting that the contextual probability of a word given the following context correlates with word duration more strongly than the past context and the bidirectional contexts for both context sizes, suggesting that phonetic reduction may be a reliable indicator of reduced cognitive load in a speaker’s planning of the rest of an utterance. M.Eng. 2024-09-24T18:25:24Z 2024-09-24T18:25:24Z 2024-05 2024-07-11T15:31:02.913Z Thesis https://hdl.handle.net/1721.1/156991 Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Martin, Kinan R.
Contextual Predictability and Phonetic Reduction
title Contextual Predictability and Phonetic Reduction
title_full Contextual Predictability and Phonetic Reduction
title_fullStr Contextual Predictability and Phonetic Reduction
title_full_unstemmed Contextual Predictability and Phonetic Reduction
title_short Contextual Predictability and Phonetic Reduction
title_sort contextual predictability and phonetic reduction
url https://hdl.handle.net/1721.1/156991
work_keys_str_mv AT martinkinanr contextualpredictabilityandphoneticreduction