Improving the Accuracy of Tesseract 4.0 OCR Engine Using Convolution-Based Preprocessing

Optical Character Recognition (OCR) is the process of identifying and converting texts rendered in images using pixels to a more computer-friendly representation. The presented work aims to prove that the accuracy of the Tesseract 4.0 OCR engine can be further enhanced by employing convolution-based...

Full description

Bibliographic Details
Main Authors: Dan Sporici, Elena Cușnir, Costin-Anton Boiangiu
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/5/715