OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more)

Machine learning begins with machine teaching: in the following paper, we present the data that we have prepared to kick-start the training of reliable OCR models for 17th century prints written in French. The construction of a representative corpus is a major challenge: we need to gather documents...

Full description

Bibliographic Details
Main Authors: Simon Gabay, Thibault Clérice, Christian Reul
Format: Article
Language:English
Published: Nicolas Turenne 2023-06-01
Series:Journal of Data Mining and Digital Humanities
Subjects:
Online Access:https://jdmdh.episciences.org/6492/pdf