An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy
The use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8318385/ |
_version_ | 1818331526276841472 |
---|---|
author | Alan Diaz-Manriquez Ana Bertha Rios-Alvarado Jose Hugo Barron-Zambrano Tania Yukary Guerrero-Melendez Juan Carlos Elizondo-Leal |
author_facet | Alan Diaz-Manriquez Ana Bertha Rios-Alvarado Jose Hugo Barron-Zambrano Tania Yukary Guerrero-Melendez Juan Carlos Elizondo-Leal |
author_sort | Alan Diaz-Manriquez |
collection | DOAJ |
description | The use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic classification of scientific texts as an optimization problem, which will allow obtaining groups from a data set. The use of evolutionary algorithms to solve classification problems has been a recurrent approach. However, there are a few approaches in which classification problems are solved, where the data attributes to be classified are text-type. In this way, it is proposed to use the association for computing machinery taxonomy to obtain the similarity between documents, where each document consists of a set of keywords. According to the results obtained, the algorithm is competitive, which indicates that the proposal of a knowledge-based genetic algorithm is a viable approach to solve the classification problem. |
first_indexed | 2024-12-13T13:21:15Z |
format | Article |
id | doaj.art-03ad0ac3c6d3498db3792f25c5507433 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-13T13:21:15Z |
publishDate | 2018-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-03ad0ac3c6d3498db3792f25c55074332022-12-21T23:44:24ZengIEEEIEEE Access2169-35362018-01-016215522155910.1109/ACCESS.2018.28159928318385An Automatic Document Classifier System Based on Genetic Algorithm and TaxonomyAlan Diaz-Manriquez0https://orcid.org/0000-0003-2847-8316Ana Bertha Rios-Alvarado1Jose Hugo Barron-Zambrano2Tania Yukary Guerrero-Melendez3Juan Carlos Elizondo-Leal4Facultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoFacultad de Ingeniera y Ciencias, Universidad Autónoma de Tamaulipas, Ciudad Victoria, MéxicoThe use of the Web has increased the creation of digital information in an accelerated way and about multiple subjects. Text classification is widely used to filter emails, classify Web pages, and organize results retrieved by Web browsers. In this paper, we propose to raise the problem of automatic classification of scientific texts as an optimization problem, which will allow obtaining groups from a data set. The use of evolutionary algorithms to solve classification problems has been a recurrent approach. However, there are a few approaches in which classification problems are solved, where the data attributes to be classified are text-type. In this way, it is proposed to use the association for computing machinery taxonomy to obtain the similarity between documents, where each document consists of a set of keywords. According to the results obtained, the algorithm is competitive, which indicates that the proposal of a knowledge-based genetic algorithm is a viable approach to solve the classification problem.https://ieeexplore.ieee.org/document/8318385/Classification algorithmsgenetic algorithmsevolutionary computationoptimization |
spellingShingle | Alan Diaz-Manriquez Ana Bertha Rios-Alvarado Jose Hugo Barron-Zambrano Tania Yukary Guerrero-Melendez Juan Carlos Elizondo-Leal An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy IEEE Access Classification algorithms genetic algorithms evolutionary computation optimization |
title | An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy |
title_full | An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy |
title_fullStr | An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy |
title_full_unstemmed | An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy |
title_short | An Automatic Document Classifier System Based on Genetic Algorithm and Taxonomy |
title_sort | automatic document classifier system based on genetic algorithm and taxonomy |
topic | Classification algorithms genetic algorithms evolutionary computation optimization |
url | https://ieeexplore.ieee.org/document/8318385/ |
work_keys_str_mv | AT alandiazmanriquez anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT anaberthariosalvarado anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT josehugobarronzambrano anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT taniayukaryguerreromelendez anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT juancarloselizondoleal anautomaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT alandiazmanriquez automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT anaberthariosalvarado automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT josehugobarronzambrano automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT taniayukaryguerreromelendez automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy AT juancarloselizondoleal automaticdocumentclassifiersystembasedongeneticalgorithmandtaxonomy |