Publishing an OCR ground truth data set for reuse in an unclear copyright setting. Two case studies with legal and technical solutions to enable a collective OCR ground truth data set effort

We present an OCR ground truth data set for historical prints and show improvement of recognition results over baselines with training on this data. We reflect on reusability of the ground truth data set based on two experiments that look into the lega...

Full description

Bibliographic Details
Main Authors: David Lassner, Julius Coburger, Clemens Neudecker, Anne Baillot
Format: Article
Language:deu
Published: Forschungsverbund Marbach Weimar Wolfenbüttel 2021-09-01
Series:Zeitschrift für digitale Geisteswissenschaften
Subjects:
Online Access:https://www.zfdg.de/node/340