Improving the Accuracy of Tesseract 4.0 OCR Engine Using Convolution-Based Preprocessing
Optical Character Recognition (OCR) is the process of identifying and converting texts rendered in images using pixels to a more computer-friendly representation. The presented work aims to prove that the accuracy of the Tesseract 4.0 OCR engine can be further enhanced by employing convolution-based...
Main Authors: | Dan Sporici, Elena Cușnir, Costin-Anton Boiangiu |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-05-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/12/5/715 |
Similar Items
-
Implementation of OCR (Optical Character Recognition) Using Tesseract in Detecting Character in Quotes Text Images
by: Ikha Novie Tri Lestari, et al.
Published: (2022-09-01) -
Exploiting Script Similarities to Compensate for the Large Amount of Data in Training Tesseract LSTM: Towards Kurdish OCR
by: Saman Idrees, et al.
Published: (2021-10-01) -
Japanese Letter Pattern Recognition Application with Tesseract Engine
by: Akhmad Imam Fahrizal, et al.
Published: (2015-12-01) -
De la bibliothèque au « Tesseract » : une représentation borgésienne de la littérature dans le film de Christopher Nolan Interstellar (2014)
by: Emmanuel Buzay
Published: (2021-06-01) -
OCR with Google Vision API and Tesseract
by: Isabelle Gribomont
Published: (2023-03-01)