Semantic Text Segmentation from Synthetic Images of Full-Text Documents

An algorithm (divided into multiple modules) for generating images of full-text documents is presented. These images can be used to train, test, and evaluate models for Optical Character Recognition (OCR). The algorithm is modular, individual parts can be changed and tweaked to generate desired i...

Full description

Bibliographic Details
Main Authors:	Lukáš Bureš, Ivan Gruber, Petr Neduchal, Miroslav Hlaváč, Marek Hrúz
Format:	Article
Language:	English
Published:	Russian Academy of Sciences, St. Petersburg Federal Research Center 2019-12-01
Series:	Информатика и автоматизация
Subjects:	generation of synthetic images semantic text segmentation variational autoencoder vae optical character recognition ocr aged-looking text generation.
Online Access:	http://ia.spcras.ru/index.php/sp/article/view/4527

Internet

http://ia.spcras.ru/index.php/sp/article/view/4527

Semantic Text Segmentation from Synthetic Images of Full-Text Documents

Internet

Similar Items