Hong Kong Corpus of Chinese Sentence and Passage Reading

Abstract Recent years have witnessed a mushrooming of reading corpora that have been built by means of eye tracking. This article showcases the Hong Kong Corpus of Chinese Sentence and Passage Reading (HKC for brevity), featured by a natural reading of logographic scripts and unspaced words. It rele...

Full description

Bibliographic Details
Main Authors: Yushu Wu, Chunyu Kit
Format: Article
Language:English
Published: Nature Portfolio 2023-12-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-023-02813-9
_version_ 1797388534223470592
author Yushu Wu
Chunyu Kit
author_facet Yushu Wu
Chunyu Kit
author_sort Yushu Wu
collection DOAJ
description Abstract Recent years have witnessed a mushrooming of reading corpora that have been built by means of eye tracking. This article showcases the Hong Kong Corpus of Chinese Sentence and Passage Reading (HKC for brevity), featured by a natural reading of logographic scripts and unspaced words. It releases 28 eye-movement measures of 98 native speakers reading simplified Chinese in two scenarios: 300 one-line single sentences and 7 multiline passages of 5,250 and 4,967 word tokens, respectively. To verify its validity and reusability, we carried out (generalised) linear mixed-effects modelling on the capacity of visual complexity, word frequency, and reading scenario to predict eye-movement measures. The outcomes manifest significant impacts of these typical (sub)lexical factors on eye movements, replicating previous findings and giving novel ones. The HKC provides a valuable resource for exploring eye movement control; the study contrasts the different scenarios of single-sentence and passage reading in hopes of shedding new light on both the universal nature of reading and the unique characteristics of Chinese reading.
first_indexed 2024-03-08T22:42:13Z
format Article
id doaj.art-4b7f80c19a6a478083d69d18ba4abe9b
institution Directory Open Access Journal
issn 2052-4463
language English
last_indexed 2024-03-08T22:42:13Z
publishDate 2023-12-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj.art-4b7f80c19a6a478083d69d18ba4abe9b2023-12-17T12:06:29ZengNature PortfolioScientific Data2052-44632023-12-0110111310.1038/s41597-023-02813-9Hong Kong Corpus of Chinese Sentence and Passage ReadingYushu Wu0Chunyu Kit1Department of Linguistics and Translation, City University of Hong KongDepartment of Linguistics and Translation, City University of Hong KongAbstract Recent years have witnessed a mushrooming of reading corpora that have been built by means of eye tracking. This article showcases the Hong Kong Corpus of Chinese Sentence and Passage Reading (HKC for brevity), featured by a natural reading of logographic scripts and unspaced words. It releases 28 eye-movement measures of 98 native speakers reading simplified Chinese in two scenarios: 300 one-line single sentences and 7 multiline passages of 5,250 and 4,967 word tokens, respectively. To verify its validity and reusability, we carried out (generalised) linear mixed-effects modelling on the capacity of visual complexity, word frequency, and reading scenario to predict eye-movement measures. The outcomes manifest significant impacts of these typical (sub)lexical factors on eye movements, replicating previous findings and giving novel ones. The HKC provides a valuable resource for exploring eye movement control; the study contrasts the different scenarios of single-sentence and passage reading in hopes of shedding new light on both the universal nature of reading and the unique characteristics of Chinese reading.https://doi.org/10.1038/s41597-023-02813-9
spellingShingle Yushu Wu
Chunyu Kit
Hong Kong Corpus of Chinese Sentence and Passage Reading
Scientific Data
title Hong Kong Corpus of Chinese Sentence and Passage Reading
title_full Hong Kong Corpus of Chinese Sentence and Passage Reading
title_fullStr Hong Kong Corpus of Chinese Sentence and Passage Reading
title_full_unstemmed Hong Kong Corpus of Chinese Sentence and Passage Reading
title_short Hong Kong Corpus of Chinese Sentence and Passage Reading
title_sort hong kong corpus of chinese sentence and passage reading
url https://doi.org/10.1038/s41597-023-02813-9
work_keys_str_mv AT yushuwu hongkongcorpusofchinesesentenceandpassagereading
AT chunyukit hongkongcorpusofchinesesentenceandpassagereading