Text Classification Algorithms: A Survey
In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natura...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-04-01
|
Series: | Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2078-2489/10/4/150 |
_version_ | 1819142074418593792 |
---|---|
author | Kamran Kowsari Kiana Jafari Meimandi Mojtaba Heidarysafa Sanjana Mendu Laura Barnes Donald Brown |
author_facet | Kamran Kowsari Kiana Jafari Meimandi Mojtaba Heidarysafa Sanjana Mendu Laura Barnes Donald Brown |
author_sort | Kamran Kowsari |
collection | DOAJ |
description | In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in real-world problems are discussed. |
first_indexed | 2024-12-22T12:04:34Z |
format | Article |
id | doaj.art-b54e8b9115f34702bf82f8516f509198 |
institution | Directory Open Access Journal |
issn | 2078-2489 |
language | English |
last_indexed | 2024-12-22T12:04:34Z |
publishDate | 2019-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Information |
spelling | doaj.art-b54e8b9115f34702bf82f8516f5091982022-12-21T18:26:29ZengMDPI AGInformation2078-24892019-04-0110415010.3390/info10040150info10040150Text Classification Algorithms: A SurveyKamran Kowsari0Kiana Jafari Meimandi1Mojtaba Heidarysafa2Sanjana Mendu3Laura Barnes4Donald Brown5Department of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USAIn recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in real-world problems are discussed.https://www.mdpi.com/2078-2489/10/4/150text classificationtext miningtext representationtext categorizationtext analysisdocument classification |
spellingShingle | Kamran Kowsari Kiana Jafari Meimandi Mojtaba Heidarysafa Sanjana Mendu Laura Barnes Donald Brown Text Classification Algorithms: A Survey Information text classification text mining text representation text categorization text analysis document classification |
title | Text Classification Algorithms: A Survey |
title_full | Text Classification Algorithms: A Survey |
title_fullStr | Text Classification Algorithms: A Survey |
title_full_unstemmed | Text Classification Algorithms: A Survey |
title_short | Text Classification Algorithms: A Survey |
title_sort | text classification algorithms a survey |
topic | text classification text mining text representation text categorization text analysis document classification |
url | https://www.mdpi.com/2078-2489/10/4/150 |
work_keys_str_mv | AT kamrankowsari textclassificationalgorithmsasurvey AT kianajafarimeimandi textclassificationalgorithmsasurvey AT mojtabaheidarysafa textclassificationalgorithmsasurvey AT sanjanamendu textclassificationalgorithmsasurvey AT laurabarnes textclassificationalgorithmsasurvey AT donaldbrown textclassificationalgorithmsasurvey |