Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development
Studying how children learn language gives complementary insights into both children and language. Given that languages have the properties that they do, what does children’s ability to learn them tell us about cognitive development? And given that children have the cognitive capacities that they do...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/143365 |
_version_ | 1826196127497584640 |
---|---|
author | Braginsky, Mika |
author2 | Gibson, Edward |
author_facet | Gibson, Edward Braginsky, Mika |
author_sort | Braginsky, Mika |
collection | MIT |
description | Studying how children learn language gives complementary insights into both children and language. Given that languages have the properties that they do, what does children’s ability to learn them tell us about cognitive development? And given that children have the cognitive capacities that they do, what does the learnability of language tell us about its properties?
I present a series of large‐scale investigations of language learning, using dense datasets and computational models to support generalization across children, over development, among languages, and while distinguishing among theories. In the initial investigation (Chapter 2), we ask: what makes words harder or easier to learn, and what does that reveal about word learning mechanisms? We examined the factors that contribute to individual words’ learning trajectories, and the consistency of those factors across languages. This work establishes a framework for conducting analyses of large‐scale language learning data in a way that brings together disparate data sources, generalizes across languages, and tracks change over development.
For subsequent investigations (Chapter 3‐4), I focus on the study of morphology learning, from various directions. Learning morphology is a particularly interesting problem because it straight‐ forwardly involves both memorizing specific forms and generalizing beyond direct experience. It is thus a fruitful case study for the interplay between mechanisms of memory and inference, which is fundamental both in language and in cognition more generally.
Chapter 3 applies the datasets and approaches developed in Chapter 1 to the domain of morphology, investigating how morphology learning relates to age, vocabulary development, and phono‐ logical structure. Chapter 4 delves further into morphology learning by applying the abstraction and rigor of computational modeling. Finally, I propose a series of studies to use a novel data col‐ lection method to create a dense dataset on morphological development and evaluate theories of morphological development.
Taken together, these studies synthesize empirical and computational methods to investigate multiple domains of language development at scale. Focusing on lexical and morphological development, they clarify and enhance the empirical and theoretical landscape of language learning. |
first_indexed | 2024-09-23T10:21:45Z |
format | Thesis |
id | mit-1721.1/143365 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T10:21:45Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1433652022-06-16T03:42:45Z Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development Braginsky, Mika Gibson, Edward Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Studying how children learn language gives complementary insights into both children and language. Given that languages have the properties that they do, what does children’s ability to learn them tell us about cognitive development? And given that children have the cognitive capacities that they do, what does the learnability of language tell us about its properties? I present a series of large‐scale investigations of language learning, using dense datasets and computational models to support generalization across children, over development, among languages, and while distinguishing among theories. In the initial investigation (Chapter 2), we ask: what makes words harder or easier to learn, and what does that reveal about word learning mechanisms? We examined the factors that contribute to individual words’ learning trajectories, and the consistency of those factors across languages. This work establishes a framework for conducting analyses of large‐scale language learning data in a way that brings together disparate data sources, generalizes across languages, and tracks change over development. For subsequent investigations (Chapter 3‐4), I focus on the study of morphology learning, from various directions. Learning morphology is a particularly interesting problem because it straight‐ forwardly involves both memorizing specific forms and generalizing beyond direct experience. It is thus a fruitful case study for the interplay between mechanisms of memory and inference, which is fundamental both in language and in cognition more generally. Chapter 3 applies the datasets and approaches developed in Chapter 1 to the domain of morphology, investigating how morphology learning relates to age, vocabulary development, and phono‐ logical structure. Chapter 4 delves further into morphology learning by applying the abstraction and rigor of computational modeling. Finally, I propose a series of studies to use a novel data col‐ lection method to create a dense dataset on morphological development and evaluate theories of morphological development. Taken together, these studies synthesize empirical and computational methods to investigate multiple domains of language development at scale. Focusing on lexical and morphological development, they clarify and enhance the empirical and theoretical landscape of language learning. Ph.D. 2022-06-15T13:15:32Z 2022-06-15T13:15:32Z 2022-02 2022-03-17T13:27:34.790Z Thesis https://hdl.handle.net/1721.1/143365 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Braginsky, Mika Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title | Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title_full | Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title_fullStr | Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title_full_unstemmed | Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title_short | Language learning at scale: Data‐driven and model‐motivated analyses of lexical and morphological development |
title_sort | language learning at scale data driven and model motivated analyses of lexical and morphological development |
url | https://hdl.handle.net/1721.1/143365 |
work_keys_str_mv | AT braginskymika languagelearningatscaledatadrivenandmodelmotivatedanalysesoflexicalandmorphologicaldevelopment |