Cognitive and communicative pressures in natural language

Thesis: Ph. D., Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, 2016.

Bibliographic Details
Main Author: Mahowald, Kyle
Other Authors: Edward Gibson.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2017
Subjects:
Online Access:http://hdl.handle.net/1721.1/106435
_version_ 1811080158409392128
author Mahowald, Kyle
author2 Edward Gibson.
author_facet Edward Gibson.
Mahowald, Kyle
author_sort Mahowald, Kyle
collection MIT
description Thesis: Ph. D., Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, 2016.
first_indexed 2024-09-23T11:26:47Z
format Thesis
id mit-1721.1/106435
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T11:26:47Z
publishDate 2017
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1064352019-04-10T18:19:33Z Cognitive and communicative pressures in natural language Mahowald, Kyle Edward Gibson. Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences. Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences. Brain and Cognitive Sciences. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, 2016. Cataloged from PDF version of thesis. Includes bibliographical references (pages 189-204). Why do languages have the words they do instead of some other set of words? In the first part of this thesis, I argue that cognitive and communicative demands strongly influence the structure of the lexicons of natural languages. It is known that words in natural language are distributed such that shorter words are more frequent and occur after more predictive contexts. I provide evidence that, at least in part, this pattern is driven by word shortenings (i.e., chimp -+ chimpanzee) and that word shortenings can be predicted by principles of efficient communication. I also show that, using nonce words with no pre-existing semantic meaning, a Zipfian correlation between length and frequency emerges in freely produced text and that this correlation is driven by participants' tendency to reuse short words more readily than longer words. In addition to word length, I investigate phonetic probability in a corpus of 97 languages. Across a wide variety of languages and language families, phonetic forms are optimized for efficient communication. And, using baseline phonetic models, I show that the words in the lexicons of four languages (English, Dutch, German, and French) are more tightly clustered in phonetic space than would be suggested by chance alone. This thesis depends on standard methods in language research. How reliable is the data that we work with as a field? In the second part of this thesis, I tackle that question by examining two dominant methods in modern language research: behavioral experiments (specifically syntactic priming) and linguistic acceptability judgments. I present data, based on large-scale surveys, showing that many of the standard syntactic and semantic judgments in a mainstream linguistic journal are flawed. Using this data, I construct a Bayesian prior over judgments and give recommendations for performing small sample-size experiments in linguistics that will not overly burden researchers. Finally, I present a large-scale meta-analysis of syntactic priming (the largest meta-analysis of a psycholinguistic phenomenon) and find that, while many priming studies are severely underpowered, there is no evidence of intense p-hacking. by Kyle Mahowald. Ph. D. 2017-01-12T18:33:23Z 2017-01-12T18:33:23Z 2016 2016 Thesis http://hdl.handle.net/1721.1/106435 967340760 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 204 pages application/pdf Massachusetts Institute of Technology
spellingShingle Brain and Cognitive Sciences.
Mahowald, Kyle
Cognitive and communicative pressures in natural language
title Cognitive and communicative pressures in natural language
title_full Cognitive and communicative pressures in natural language
title_fullStr Cognitive and communicative pressures in natural language
title_full_unstemmed Cognitive and communicative pressures in natural language
title_short Cognitive and communicative pressures in natural language
title_sort cognitive and communicative pressures in natural language
topic Brain and Cognitive Sciences.
url http://hdl.handle.net/1721.1/106435
work_keys_str_mv AT mahowaldkyle cognitiveandcommunicativepressuresinnaturallanguage