An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy

The use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic...

Full description

Bibliographic Details
Main Authors: Alan Diaz-Manriquez, Ana Bertha Rios-Alvarado, Jose Hugo Barron-Zambrano, Tania Yukary Guerrero-Melendez, Juan Carlos Elizondo-Leal
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8318385/
_version_ 1818331526276841472
author Alan Diaz-Manriquez
Ana Bertha Rios-Alvarado
Jose Hugo Barron-Zambrano
Tania Yukary Guerrero-Melendez
Juan Carlos Elizondo-Leal
author_facet Alan Diaz-Manriquez
Ana Bertha Rios-Alvarado
Jose Hugo Barron-Zambrano
Tania Yukary Guerrero-Melendez
Juan Carlos Elizondo-Leal
author_sort Alan Diaz-Manriquez
collection DOAJ
description The use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic classification of scientific texts as an optimization problem, which will allow obtaining groups from a data set. The use of evolutionary algorithms to solve classification problems has been a recurrent approach. However, there are a few approaches in which classification problems are solved, where the data attributes to be classified are text-type. In this way, it is proposed to use the association for computing machinery taxonomy to obtain the similarity between documents, where each document consists of a set of keywords. According to the results obtained, the algorithm is competitive, which indicates that the proposal of a knowledge-based genetic algorithm is a viable approach to solve the classification problem.
first_indexed 2024-12-13T13:21:15Z
format Article
id doaj.art-03ad0ac3c6d3498db3792f25c5507433
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-13T13:21:15Z
publishDate 2018-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-03ad0ac3c6d3498db3792f25c55074332022-12-21T23:44:24ZengIEEEIEEE Access2169-35362018-01-016215522155910.1109/ACCESS.2018.28159928318385An Automatic Document Classifier System Based on Genetic Algorithm and TaxonomyAlan Diaz-Manriquez0https://orcid.org/0000-0003-2847-8316Ana Bertha Rios-Alvarado1Jose Hugo Barron-Zambrano2Tania Yukary Guerrero-Melendez3Juan Carlos Elizondo-Leal4Facultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoThe use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic classification of scientific texts as an optimization problem, which will allow obtaining groups from a data set. The use of evolutionary algorithms to solve classification problems has been a recurrent approach. However, there are a few approaches in which classification problems are solved, where the data attributes to be classified are text-type. In this way, it is proposed to use the association for computing machinery taxonomy to obtain the similarity between documents, where each document consists of a set of keywords. According to the results obtained, the algorithm is competitive, which indicates that the proposal of a knowledge-based genetic algorithm is a viable approach to solve the classification problem.https://ieeexplore.ieee.org/document/8318385/Classification algorithmsgenetic algorithmsevolutionary computationoptimization
spellingShingle Alan Diaz-Manriquez
Ana Bertha Rios-Alvarado
Jose Hugo Barron-Zambrano
Tania Yukary Guerrero-Melendez
Juan Carlos Elizondo-Leal
An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
IEEE Access
Classification algorithms
genetic algorithms
evolutionary computation
optimization
title An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
title_full An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
title_fullStr An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
title_full_unstemmed An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
title_short An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
title_sort automatic document classifier system based on genetic algorithm and taxonomy
topic Classification algorithms
genetic algorithms
evolutionary computation
optimization
url https://ieeexplore.ieee.org/document/8318385/
work_keys_str_mv AT alandiazmanriquez anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT anaberthariosalvarado anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT josehugobarronzambrano anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT taniayukaryguerreromelendez anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT juancarloselizondoleal anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT alandiazmanriquez automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT anaberthariosalvarado automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT josehugobarronzambrano automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT taniayukaryguerreromelendez automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy
AT juancarloselizondoleal automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy