Corpus of Spanish Golden-Age Sonnets
In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the Univ...
Main Author: | |
---|---|
Format: | Article |
Language: | deu |
Published: |
Institut für Dokumentologie und Editorik e. V.
2017-09-01
|
Series: | RIDE |
Subjects: | |
Online Access: | https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/ |
_version_ | 1797366017637220352 |
---|---|
author | José Calvo Tello |
author_facet | José Calvo Tello |
author_sort | José Calvo Tello |
collection | DOAJ |
description | In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the University of Alicante, Spain. One of the strongest aspects of this corpus is the metrical annotation of each verse. The researchers have already analysed the corpus using topic modelling, a suitable technique for the structure of the collection and the size of the texts. The weakest aspect of this collection is the metadata of the files: the majority of them are redundant and some important aspects (e.g. identifiers of texts, author, collection, source) are missing. The corpus is available as a GitHub repository, a good practice that facilitates cloning all the data, the track of changes and the preservation of the corpus. |
first_indexed | 2024-03-08T16:58:14Z |
format | Article |
id | doaj.art-bea7efdf226641efa5d21a4f26e40368 |
institution | Directory Open Access Journal |
issn | 2363-4952 |
language | deu |
last_indexed | 2024-03-08T16:58:14Z |
publishDate | 2017-09-01 |
publisher | Institut für Dokumentologie und Editorik e. V. |
record_format | Article |
series | RIDE |
spelling | doaj.art-bea7efdf226641efa5d21a4f26e403682024-01-04T18:19:37ZdeuInstitut für Dokumentologie und Editorik e. V.RIDE2363-49522017-09-01610.18716/ride.a.6.4Corpus of Spanish Golden-Age SonnetsJosé Calvo Tello0https://orcid.org/0000-0002-1129-5604University of WürzburgIn this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the University of Alicante, Spain. One of the strongest aspects of this corpus is the metrical annotation of each verse. The researchers have already analysed the corpus using topic modelling, a suitable technique for the structure of the collection and the size of the texts. The weakest aspect of this collection is the metadata of the files: the majority of them are redundant and some important aspects (e.g. identifiers of texts, author, collection, source) are missing. The corpus is available as a GitHub repository, a good practice that facilitates cloning all the data, the track of changes and the preservation of the corpus.https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/poetrysiglo de orosonnetsspanishteitext collection |
spellingShingle | José Calvo Tello Corpus of Spanish Golden-Age Sonnets RIDE poetry siglo de oro sonnets spanish tei text collection |
title | Corpus of Spanish Golden-Age Sonnets |
title_full | Corpus of Spanish Golden-Age Sonnets |
title_fullStr | Corpus of Spanish Golden-Age Sonnets |
title_full_unstemmed | Corpus of Spanish Golden-Age Sonnets |
title_short | Corpus of Spanish Golden-Age Sonnets |
title_sort | corpus of spanish golden age sonnets |
topic | poetry siglo de oro sonnets spanish tei text collection |
url | https://ride.i-d-e.de/issues/issue-6/corpus-of-spanish-golden-age-sonnets/ |
work_keys_str_mv | AT josecalvotello corpusofspanishgoldenagesonnets |