Obsah a značkování diachronního korpusu češtiny : The Content and Annotation of the Diachronic Corpus of Czech

The paper discusses what kind of content and annotation should be included in the diachronic corpus of Old Czech. Based on his analysis of the current state of DIAKORP and the Old Czech Text Bank the author suggests solutions for how to treat the critical apparatus, foreign words in historical Cz...

Full description

Bibliographic Details
Main Author: Lehečka Boris
Format: Article
Language:ces
Published: Univerzita Karlova, Filozofická fakulta 2016-01-01
Series:Časopis pro Moderní Filologii
Subjects:
Online Access:http://casopispromodernifilologii.ff.cuni.cz/wp-content/uploads/sites/9/2015/07/Boris_Lehecka_70-77.pdf
Description
Summary:The paper discusses what kind of content and annotation should be included in the diachronic corpus of Old Czech. Based on his analysis of the current state of DIAKORP and the Old Czech Text Bank the author suggests solutions for how to treat the critical apparatus, foreign words in historical Czech texts and contemporaneous or later marginal or interlinear notes. He also discusses some aspects of the methodology of statistics computation in the diachronic corpus.
ISSN:0008-7386
2336-6591