Corpus of Spanish Golden-Age Sonnets

In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the Univ...

Full description

Bibliographic Details
Main Author: José Calvo Tello
Format: Article
Language:deu
Published: Institut für Dokumentologie und Editorik e. V. 2017-09-01
Series:RIDE
Subjects:
Online Access:https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/
_version_ 1797366017637220352
author José Calvo Tello
author_facet José Calvo Tello
author_sort José Calvo Tello
collection DOAJ
description In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the University of Alicante, Spain. One of the strongest aspects of this corpus is the metrical annotation of each verse. The researchers have already analysed the corpus using topic modelling, a suitable technique for the structure of the collection and the size of the texts. The weakest aspect of this collection is the metadata of the files: the majority of them are redundant and some important aspects (e.g. identifiers of texts, author, collection, source) are missing. The corpus is available as a GitHub repository, a good practice that facilitates cloning all the data, the track of changes and the preservation of the corpus.
first_indexed 2024-03-08T16:58:14Z
format Article
id doaj.art-bea7efdf226641efa5d21a4f26e40368
institution Directory Open Access Journal
issn 2363-4952
language deu
last_indexed 2024-03-08T16:58:14Z
publishDate 2017-09-01
publisher Institut für Dokumentologie und Editorik e. V.
record_format Article
series RIDE
spelling doaj.art-bea7efdf226641efa5d21a4f26e403682024-01-04T18:19:37ZdeuInstitut für Dokumentologie und Editorik e. V.RIDE2363-49522017-09-01610.18716/ride.a.6.4Corpus of Spanish Golden-Age SonnetsJosé Calvo Tello0https://orcid.org/0000-0002-1129-5604University of WürzburgIn this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the University of Alicante, Spain. One of the strongest aspects of this corpus is the metrical annotation of each verse. The researchers have already analysed the corpus using topic modelling, a suitable technique for the structure of the collection and the size of the texts. The weakest aspect of this collection is the metadata of the files: the majority of them are redundant and some important aspects (e.g. identifiers of texts, author, collection, source) are missing. The corpus is available as a GitHub repository, a good practice that facilitates cloning all the data, the track of changes and the preservation of the corpus.https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/poetrysiglo de orosonnetsspanishteitext collection
spellingShingle José Calvo Tello
Corpus of Spanish Golden-Age Sonnets
RIDE
poetry
siglo de oro
sonnets
spanish
tei
text collection
title Corpus of Spanish Golden-Age Sonnets
title_full Corpus of Spanish Golden-Age Sonnets
title_fullStr Corpus of Spanish Golden-Age Sonnets
title_full_unstemmed Corpus of Spanish Golden-Age Sonnets
title_short Corpus of Spanish Golden-Age Sonnets
title_sort corpus of spanish golden age sonnets
topic poetry
siglo de oro
sonnets
spanish
tei
text collection
url https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/
work_keys_str_mv AT josecalvotello corpusofspanishgoldenagesonnets