The Challenges of Large‐Scale, Web‐Based Language Datasets: Word Length and Predictability Revisited
Main Authors: | Meylan, Stephan C., Griffiths, Thomas L. |
---|---|
Other Authors: | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences |
Format: | Article |
Language: | English |
Published: |
Wiley
2022
|
Online Access: | https://hdl.handle.net/1721.1/140541 |
Similar Items
-
Tone and word length across languages
by: Søren Wichmann
Published: (2023-06-01) -
Dataset of Karakalpak language stop words
by: Khabibulla Madatov, et al.
Published: (2023-06-01) -
Word-length algorithm for language identification of under-resourced languages
by: Selamat, A., et al.
Published: (2016) -
Large-scale evidence of dependency length minimization in 37 languages
by: Futrell, Richard Landy Jones, et al.
Published: (2016) -
MyWSL: Malaysian words sign language dataset
by: Rina Tasia Johari, et al.
Published: (2023-08-01)