Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning app...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-10-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340921005801 |
_version_ | 1819024182250307584 |
---|---|
author | Natália Bezerra Mota Sylvia Pinheiro Antonio Guerreiro Mauro Copelli Sidarta Ribeiro |
author_facet | Natália Bezerra Mota Sylvia Pinheiro Antonio Guerreiro Mauro Copelli Sidarta Ribeiro |
author_sort | Natália Bezerra Mota |
collection | DOAJ |
description | Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length). |
first_indexed | 2024-12-21T04:50:43Z |
format | Article |
id | doaj.art-7f6325cfb2ee4aa295b055e7ab704589 |
institution | Directory Open Access Journal |
issn | 2352-3409 |
language | English |
last_indexed | 2024-12-21T04:50:43Z |
publishDate | 2021-10-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj.art-7f6325cfb2ee4aa295b055e7ab7045892022-12-21T19:15:26ZengElsevierData in Brief2352-34092021-10-0138107296Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narrativesNatália Bezerra Mota0Sylvia Pinheiro1Antonio Guerreiro2Mauro Copelli3Sidarta Ribeiro4Instituto do Cérebro, Universidade Federal do Rio Grande do Norte, Natal, Brazil; Departamento de Física, Universidade Federal de Pernambuco, Recife, BrazilInstituto do Cérebro, Universidade Federal do Rio Grande do Norte, Natal, BrazilDepartamento de Antropologia, Universidade Estadual de Campinas, Campinas, BrazilDepartamento de Física, Universidade Federal de Pernambuco, Recife, BrazilInstituto do Cérebro, Universidade Federal do Rio Grande do Norte, Natal, Brazil; Corresponding author.Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length).http://www.sciencedirect.com/science/article/pii/S2352340921005801GraphLiteratureBronze ageAxial ageLanguage evolution |
spellingShingle | Natália Bezerra Mota Sylvia Pinheiro Antonio Guerreiro Mauro Copelli Sidarta Ribeiro Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives Data in Brief Graph Literature Bronze age Axial age Language evolution |
title | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_full | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_fullStr | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_full_unstemmed | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_short | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_sort | nonsemantic word graphs of texts spanning ∼ 4500 years including pre literate amerindian oral narratives |
topic | Graph Literature Bronze age Axial age Language evolution |
url | http://www.sciencedirect.com/science/article/pii/S2352340921005801 |
work_keys_str_mv | AT nataliabezerramota nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT sylviapinheiro nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT antonioguerreiro nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT maurocopelli nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT sidartaribeiro nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives |