Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model

Currently, approaches to correcting misspelled words have problems when the words are complex or massive. This is even more serious in the case of Spanish, where there are very few studies in this regard. So, proposing new approaches to word recognition and correction remains a research topic of int...

Full description

Bibliographic Details
Main Authors: Eduard Puerto, Jose Aguilar, Angel Pinto
Format: Article
Language:English
Published: MDPI AG 2024-03-01
Series:Computers
Subjects:
Online Access:https://www.mdpi.com/2073-431X/13/3/76
_version_ 1797241592901271552
author Eduard Puerto
Jose Aguilar
Angel Pinto
author_facet Eduard Puerto
Jose Aguilar
Angel Pinto
author_sort Eduard Puerto
collection DOAJ
description Currently, approaches to correcting misspelled words have problems when the words are complex or massive. This is even more serious in the case of Spanish, where there are very few studies in this regard. So, proposing new approaches to word recognition and correction remains a research topic of interest. In particular, an interesting approach is to computationally simulate the brain process for recognizing misspelled words and their automatic correction. Thus, this article presents an automatic recognition and correction system of misspelled words in Spanish texts, for the detection of misspelled words, and their automatic amendments, based on the systematic theory of pattern recognition of the mind (PRTM). The main innovation of the research is the use of the PRTM theory in this context. Particularly, a corrective system of misspelled words in Spanish based on this theory, called Ar2p-Text, was designed and built. Ar2p-Text carries out a recursive process of analysis of words by a disaggregation/integration mechanism, using specialized hierarchical recognition modules that define formal strategies to determine if a word is well or poorly written. A comparative evaluation shows that the precision and coverage of our Ar2p-Text model are competitive with other spell-checkers. In the experiments, the system achieves better performance than the three other systems. In general, Ar2p-Text obtains an F-measure of 83%, above the 73% achieved by the other spell-checkers. Our hierarchical approach reuses a lot of information, allowing for the improvement of the text analysis processes in both quality and efficiency. Preliminary results show that the above will allow for future developments of technologies for the correction of words inspired by this hierarchical approach.
first_indexed 2024-04-24T18:25:47Z
format Article
id doaj.art-858457f962ac4ca3ba668a5a2eef7b91
institution Directory Open Access Journal
issn 2073-431X
language English
last_indexed 2024-04-24T18:25:47Z
publishDate 2024-03-01
publisher MDPI AG
record_format Article
series Computers
spelling doaj.art-858457f962ac4ca3ba668a5a2eef7b912024-03-27T13:32:02ZengMDPI AGComputers2073-431X2024-03-011337610.3390/computers13030076Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network ModelEduard Puerto0Jose Aguilar1Angel Pinto2Grupo de Investigación en Inteligencia Artificial (GIA), Facultad de Ingeniería, Universidad Francisco de Paula Santander, Cúcuta 540001, ColombiaCentro de Estudio en Microcomputación y Sistemas Distribuidos (CEMISID), Facultad de Ingeniería, Universidad de Los Andes, Mérida 5101, VenezuelaGrupo de Investigación TESEEO, Universidad del Sinú, Montería 230001, ColombiaCurrently, approaches to correcting misspelled words have problems when the words are complex or massive. This is even more serious in the case of Spanish, where there are very few studies in this regard. So, proposing new approaches to word recognition and correction remains a research topic of interest. In particular, an interesting approach is to computationally simulate the brain process for recognizing misspelled words and their automatic correction. Thus, this article presents an automatic recognition and correction system of misspelled words in Spanish texts, for the detection of misspelled words, and their automatic amendments, based on the systematic theory of pattern recognition of the mind (PRTM). The main innovation of the research is the use of the PRTM theory in this context. Particularly, a corrective system of misspelled words in Spanish based on this theory, called Ar2p-Text, was designed and built. Ar2p-Text carries out a recursive process of analysis of words by a disaggregation/integration mechanism, using specialized hierarchical recognition modules that define formal strategies to determine if a word is well or poorly written. A comparative evaluation shows that the precision and coverage of our Ar2p-Text model are competitive with other spell-checkers. In the experiments, the system achieves better performance than the three other systems. In general, Ar2p-Text obtains an F-measure of 83%, above the 73% achieved by the other spell-checkers. Our hierarchical approach reuses a lot of information, allowing for the improvement of the text analysis processes in both quality and efficiency. Preliminary results show that the above will allow for future developments of technologies for the correction of words inspired by this hierarchical approach.https://www.mdpi.com/2073-431X/13/3/76spell-checkertext recognitionAr2p-Text
spellingShingle Eduard Puerto
Jose Aguilar
Angel Pinto
Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
Computers
spell-checker
text recognition
Ar2p-Text
title Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
title_full Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
title_fullStr Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
title_full_unstemmed Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
title_short Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model
title_sort automatic spell checking system for spanish based on the ar2p neural network model
topic spell-checker
text recognition
Ar2p-Text
url https://www.mdpi.com/2073-431X/13/3/76
work_keys_str_mv AT eduardpuerto automaticspellcheckingsystemforspanishbasedonthear2pneuralnetworkmodel
AT joseaguilar automaticspellcheckingsystemforspanishbasedonthear2pneuralnetworkmodel
AT angelpinto automaticspellcheckingsystemforspanishbasedonthear2pneuralnetworkmodel