Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization

Arabic text clustering is an essential topic in Arabic Natural Language Processing (ANLP). Its significance resides in various applications, such as document indexing, categorization, user review analysis, and others. After inspecting the current work on clustering Arabic text, it is observed that m...

Full description

Bibliographic Details
Main Authors: Souad Larabi-Marie-Sainte, Mashael Bin Alamir, Abdulmajeed Alameer
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/18/10168
_version_ 1797581461593784320
author Souad Larabi-Marie-Sainte
Mashael Bin Alamir
Abdulmajeed Alameer
author_facet Souad Larabi-Marie-Sainte
Mashael Bin Alamir
Abdulmajeed Alameer
author_sort Souad Larabi-Marie-Sainte
collection DOAJ
description Arabic text clustering is an essential topic in Arabic Natural Language Processing (ANLP). Its significance resides in various applications, such as document indexing, categorization, user review analysis, and others. After inspecting the current work on clustering Arabic text, it is observed that most researchers focus on applying K-Means clustering while hindering other clustering techniques. Our evaluation shows that K-Means has a weakness of inconsistent clustering results and weak clustering performance when the data dimensionality increases. Unlike K-Means clustering, Artificial Neural Networks (ANN) models such as Self-Organizing Maps (SOM) demonstrated higher accuracy and efficiency in clustering even with high dimensional datasets. In this paper, we introduce a new clustering model based on an optimization technique called Grey Wolf Optimization (GWO) used conjointly with SOM clustering to enhance its clustering performance and accuracy. The evaluation results of our proposed technique show an improvement in the effectiveness and efficiency in comparison with state-of-the-art approaches.
first_indexed 2024-03-10T23:05:00Z
format Article
id doaj.art-becca76398a344fb8ddb040ad88fda3e
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T23:05:00Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-becca76398a344fb8ddb040ad88fda3e2023-11-19T09:23:57ZengMDPI AGApplied Sciences2076-34172023-09-0113181016810.3390/app131810168Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf OptimizationSouad Larabi-Marie-Sainte0Mashael Bin Alamir1Abdulmajeed Alameer2Computer Science Department, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi ArabiaGraduate Unit, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi ArabiaComputer Science Department, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi ArabiaArabic text clustering is an essential topic in Arabic Natural Language Processing (ANLP). Its significance resides in various applications, such as document indexing, categorization, user review analysis, and others. After inspecting the current work on clustering Arabic text, it is observed that most researchers focus on applying K-Means clustering while hindering other clustering techniques. Our evaluation shows that K-Means has a weakness of inconsistent clustering results and weak clustering performance when the data dimensionality increases. Unlike K-Means clustering, Artificial Neural Networks (ANN) models such as Self-Organizing Maps (SOM) demonstrated higher accuracy and efficiency in clustering even with high dimensional datasets. In this paper, we introduce a new clustering model based on an optimization technique called Grey Wolf Optimization (GWO) used conjointly with SOM clustering to enhance its clustering performance and accuracy. The evaluation results of our proposed technique show an improvement in the effectiveness and efficiency in comparison with state-of-the-art approaches.https://www.mdpi.com/2076-3417/13/18/10168natural language processingtext clusteringArabic textgrey wolf optimizationdocument categorizationtopic extraction
spellingShingle Souad Larabi-Marie-Sainte
Mashael Bin Alamir
Abdulmajeed Alameer
Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
Applied Sciences
natural language processing
text clustering
Arabic text
grey wolf optimization
document categorization
topic extraction
title Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
title_full Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
title_fullStr Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
title_full_unstemmed Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
title_short Arabic Text Clustering Using Self-Organizing Maps and Grey Wolf Optimization
title_sort arabic text clustering using self organizing maps and grey wolf optimization
topic natural language processing
text clustering
Arabic text
grey wolf optimization
document categorization
topic extraction
url https://www.mdpi.com/2076-3417/13/18/10168
work_keys_str_mv AT souadlarabimariesainte arabictextclusteringusingselforganizingmapsandgreywolfoptimization
AT mashaelbinalamir arabictextclusteringusingselforganizingmapsandgreywolfoptimization
AT abdulmajeedalameer arabictextclusteringusingselforganizingmapsandgreywolfoptimization