Two bigrams based language model for auto correction of Arabic OCR errors

In Optical character recognition (OCR), the characteristics of Arabic text cause more errors than in English text.In this paper, a two bi-grams based language model that uses Wikipedia's database is presented.The method can perform auto detection and correction of non-word errors in Arabic OCR...

Full description

Bibliographic Details
Main Authors:	Habeeb, Imad Q., Mohd Yusof, Shahrul Azmi, Ahmad, Faudziah
Format:	Article
Language:	English
Published:	AICIT, Korea 2014
Subjects:	QA76 Computer software
Online Access:	https://repo.uum.edu.my/id/eprint/12602/1/JDCTA3630PPL.pdf

Internet

https://repo.uum.edu.my/id/eprint/12602/1/JDCTA3630PPL.pdf

Two bigrams based language model for auto correction of Arabic OCR errors

Internet

Similar Items