OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more)
Machine learning begins with machine teaching: in the following paper, we present the data that we have prepared to kick-start the training of reliable OCR models for 17th century prints written in French. The construction of a representative corpus is a major challenge: we need to gather documents...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nicolas Turenne
2023-06-01
|
Series: | Journal of Data Mining and Digital Humanities |
Subjects: | |
Online Access: | https://jdmdh.episciences.org/6492/pdf |