Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?

Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?

AbstractThis work presents a linguistic analysis into why larger Transformer-based pre-trained language models with more parameters and lower perplexity nonetheless yield surprisal estimates that are less predictive of human reading times. First, regression analyses show a strictly m...

Full description

Bibliographic Details
Main Authors:	Byung-Doh Oh, William Schuler
Format:	Article
Language:	English
Published:	The MIT Press 2023-01-01
Series:	Transactions of the Association for Computational Linguistics
Online Access:	https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00548/115371/Why-Does-Surprisal-From-Larger-Transformer-Based

Similar Items

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators
by: Byung-Doh Oh, et al.
Published: (2022-03-01)

Speaker Input Variability Does Not Explain Why Larger Populations Have Simpler Languages.
by: Mark Atkinson, et al.
Published: (2015-01-01)

Why Does the NOTION Trial Show Poorer than Expected Outcomes in the Surgical Arm?
by: Stefano Urso, et al.
Published: (2022-01-01)

Is larger eccentric utilization ratio associated with poorer rate of force development in squat jump? An exploratory study
by: Žiga Kozinc, et al.
Published: (2024-12-01)

Surprise! Why Insightful Solution Is Pleasurable
by: Anna Savinova, et al.
Published: (2022-11-01)

Does exposure to richer and poorer neighborhoods influence wellbeing?
by: Wang, D, et al.
Published: (2019)

Earthquakes in the Shadows: Why Aftershocks Occur at Surprising Locations
by: Jeanne L. Hardebeck, et al.
Published: (2022-09-01)

Providing safe passage into a larger life: supporting clients’ transformational change through coaching
by: Elke Hanssmann
Published: (2014-06-01)

Sicker and Poorer
by: Na Yin
Published: (2015-02-01)

Dimensions of Segmental Variability: Interaction of Prosody and Surprisal in Six Languages
by: Zofia Malisz, et al.
Published: (2018-07-01)

Does epithelioid angiomyolipoma have poorer prognosis, compared with classic angiomyolipoma?
by: Wonchul Lee, et al.
Published: (2018-11-01)

Approach and Avoidance During Routine Behavior and During Surprise in a Non-evaluative Task: Surprise Matters and So Does the Valence of the Surprising Event
by: Achim Schützwohl
Published: (2018-06-01)

Why do the impacts of coronavirus disease 2019 and the response surprise the world?
by: Emmanuel Raju, et al.
Published: (2020-12-01)

How we learn : the surprising truth about when, where, and why it happens /
by: Carey, Benedict, author
Published: (2014)

Memory surprising new insights into how we remember and why we forget /
by: 388972 Loftus, Elizabeth
Published: (1980)

A surprising prevention success: why did the HIV epidemic decline in Zimbabwe?
by: Daniel T Halperin, et al.
Published: (2011-02-01)

Does unemployment contribute to poorer health-related quality of life among Swedish adults?
by: Fredrik Norström, et al.
Published: (2019-04-01)

Patchy echogenicity of the liver in patients with chronic hepatitis B does not indicate poorer elasticity
by: Size Wu, et al.
Published: (2019-10-01)

More imports, poorer farmers
by: Mahavera, Sheridan
Published: (2023)

Normal-Weight Obesity Is Associated with Poorer Cardiometabolic Profile and Lower Physical Fitness Levels in Children and Adolescents
by: Antonio García-Hermoso, et al.
Published: (2020-04-01)

On Principles of Least Change and Least Surprise for Bidirectional Transformations
by: Cheney, J, et al.
Published: (2017)

A surprising lack of consequences when constraining language
by: Thomas Ian Vaughan-Johnston, et al.
Published: (2024-03-01)

Salience and attention in surprisal-based accounts of language processing
by: Alessandra eZarcone, et al.
Published: (2016-06-01)

Does the “surprisingly popular” method yield accurate crowdsourced predictions?
by: Abraham M. Rutchick, et al.
Published: (2020-11-01)

Who says "larger" and who says "smaller"? Individual differences in the language of comparison
by: William J. Skylark, et al.
Published: (2018-11-01)

Who says “larger” and who says “smaller”? Individual differences in the language of comparison
by: William J. Skylark, et al.
Published: (2018-11-01)

Surprises.
by: Weiskrantz, L
Published: (2008)

Surprise
by: Ross, Wendy, et al.
Published: (2022)

No Surprises
by: Wells, Ian
Published: (2021)

Why Petals? Naïve, but Not Experienced Bees, Preferentially Visit Flowers with Larger Visual Signals
by: Nicholas J. Balfour, et al.
Published: (2023-01-01)

“One glove does not fit all” in bilingual reading acquisition: Using the age of first bilingual language exposure to understand optimal contexts for reading success
by: Ioulia Kovelman, et al.
Published: (2015-12-01)

Lower Education and Reading and Writing Habits Are Associated With Poorer Oral Discourse Production in Typical Adults and Older Adults
by: Bárbara Luzia Covatti Malcorra, et al.
Published: (2022-03-01)

Why India’s Post-1998 Evolution as a Conventional Nuclear Weapons Power Evokes Surprise
by: Gaurav Kampani
Published: (2019-01-01)

On surprise-hacking
by: Felin, T, et al.
Published: (2019)

No Bad Surprises
by: Maximilian Steinbeis
Published: (2023-05-01)

Impedance surprises
by: Brown Brian H.
Published: (2015-12-01)

Novelty or Surprise?
by: Andrew eBarto, et al.
Published: (2013-12-01)

Deprivation amplification revisited; or, is it always true that poorer places have poorer access to resources for healthy diets and physical activity?
by: Macintyre Sally
Published: (2007-08-01)

Surprise, Surprise — A Flying Squirrel! Learning to Protect the Unexpected
by: Nina V Nygren, et al.
Published: (2020-01-01)

Homo admirans «a surprising person» in phraseological space of the french language
by: I I Sinelnikova
Published: (2008-03-01)