Text Classification Algorithms: A Survey

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natura...

Full description

Bibliographic Details
Main Authors: Kamran Kowsari, Kiana Jafari Meimandi, Mojtaba Heidarysafa, Sanjana Mendu, Laura Barnes, Donald Brown
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/10/4/150
_version_ 1819142074418593792
author Kamran Kowsari
Kiana Jafari Meimandi
Mojtaba Heidarysafa
Sanjana Mendu
Laura Barnes
Donald Brown
author_facet Kamran Kowsari
Kiana Jafari Meimandi
Mojtaba Heidarysafa
Sanjana Mendu
Laura Barnes
Donald Brown
author_sort Kamran Kowsari
collection DOAJ
description In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in real-world problems are discussed.
first_indexed 2024-12-22T12:04:34Z
format Article
id doaj.art-b54e8b9115f34702bf82f8516f509198
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-12-22T12:04:34Z
publishDate 2019-04-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-b54e8b9115f34702bf82f8516f5091982022-12-21T18:26:29ZengMDPI AGInformation2078-24892019-04-0110415010.3390/info10040150info10040150Text Classification Algorithms: A SurveyKamran Kowsari0Kiana Jafari Meimandi1Mojtaba Heidarysafa2Sanjana Mendu3Laura Barnes4Donald Brown5Department of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USADepartment of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22904, USAIn recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in real-world problems are discussed.https://www.mdpi.com/2078-2489/10/4/150text classificationtext miningtext representationtext categorizationtext analysisdocument classification
spellingShingle Kamran Kowsari
Kiana Jafari Meimandi
Mojtaba Heidarysafa
Sanjana Mendu
Laura Barnes
Donald Brown
Text Classification Algorithms: A Survey
Information
text classification
text mining
text representation
text categorization
text analysis
document classification
title Text Classification Algorithms: A Survey
title_full Text Classification Algorithms: A Survey
title_fullStr Text Classification Algorithms: A Survey
title_full_unstemmed Text Classification Algorithms: A Survey
title_short Text Classification Algorithms: A Survey
title_sort text classification algorithms a survey
topic text classification
text mining
text representation
text categorization
text analysis
document classification
url https://www.mdpi.com/2078-2489/10/4/150
work_keys_str_mv AT kamrankowsari textclassificationalgorithmsasurvey
AT kianajafarimeimandi textclassificationalgorithmsasurvey
AT mojtabaheidarysafa textclassificationalgorithmsasurvey
AT sanjanamendu textclassificationalgorithmsasurvey
AT laurabarnes textclassificationalgorithmsasurvey
AT donaldbrown textclassificationalgorithmsasurvey