Two bigrams based language model for auto correction of Arabic OCR errors
In Optical character recognition (OCR), the characteristics of Arabic text cause more errors than in English text.In this paper, a two bi-grams based language model that uses Wikipedia's database is presented.The method can perform auto detection and correction of non-word errors in Arabic OCR...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AICIT, Korea
2014
|
Subjects: | |
Online Access: | https://repo.uum.edu.my/id/eprint/12602/1/JDCTA3630PPL.pdf |