Revisiting K-Means and Topic Modeling, a Comparison Study to Cluster Arabic Documents
Clustering Arabic text documents is of high importance for many natural language technologies. This paper uses a combined method to cluster Arabic text documents. Mainly, we use generative models and clustering techniques. The study uses latent Dirichlet allocation and k-means clustering algorithm a...
Main Authors: | M. Alhawarat, M. Hegazi |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8402221/ |
Similar Items
-
Urdu Documents Clustering with Unsupervised and Semi-Supervised Probabilistic Topic Modeling
by: Mubashar Mustafa, et al.
Published: (2020-11-01) -
Leveraging Global and Local Topic Popularities for LDA-Based Document Clustering
by: Peng Yang, et al.
Published: (2020-01-01) -
Survey of Automatic Labeling Methods for Topic Models
by: HE Dongbin, TAO Sha, ZHU Yanhong, REN Yanzhao, CHU Yunxia
Published: (2023-12-01) -
Social-Child-Case Document Clustering based on Topic Modeling using Latent Dirichlet Allocation
by: Nur Annisa Tresnasari, et al.
Published: (2020-04-01) -
Identification of Hot Topics and Trends in Knowledge and Information Science, Based on Text Mining Techniques
by: Elahe Akhavanhariri, et al.
Published: (2022-12-01)