Integrated Sequence Tagging for Medieval Latin Using Deep Representation Learning

In this paper we consider two sequence tagging tasks for medieval Latin: part-of-speech tagging and lemmatization. These are both basic, yet foundational preprocessing steps in applications such as text re-use detection. Nevertheless, they are generally complicated by the considerable orthographic v...

Cijeli opis

Bibliografski detalji
Glavni autori: Mike Kestemont, Jeroen De Gussem
Format: Članak
Jezik:English
Izdano: Nicolas Turenne 2017-08-01
Serija:Journal of Data Mining and Digital Humanities
Teme:
Online pristup:https://jdmdh.episciences.org/1398/pdf