Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering

Text document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering...

Full description

Bibliographic Details
Main Authors: Suganya Selvaraj, Eunmi Choi
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/24/9653
_version_ 1797455402075422720
author Suganya Selvaraj
Eunmi Choi
author_facet Suganya Selvaraj
Eunmi Choi
author_sort Suganya Selvaraj
collection DOAJ
description Text document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering problems compared to traditional clustering algorithms. The previous studies show that in SI algorithms, particle swarm optimization (PSO) provides an effective solution to text document clustering problems. This PSO still needs to be improved to avoid the problems such as premature convergence to local optima. In this paper, an approach called dynamic sub-swarm of PSO (subswarm-PSO) is proposed to improve the results of PSO for text document clustering problems and avoid the local optimum by improving the global search capabilities of PSO. The results of this proposed approach were compared with the standard PSO algorithm and K-means algorithm. As for performance assurance, the evaluation metric purity is used with six benchmark data sets. The experimental results of this study show that our proposed subswarm-PSO algorithm performs best with high purity comparing the standard PSO and K-means traditional algorithms and also the execution time of subswarm-PSO comparatively takes a little less than the standard PSO algorithm.
first_indexed 2024-03-09T15:52:54Z
format Article
id doaj.art-c9181edf30034a3d98a0b0180ebf619b
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T15:52:54Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-c9181edf30034a3d98a0b0180ebf619b2023-11-24T17:53:07ZengMDPI AGSensors1424-82202022-12-012224965310.3390/s22249653Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document ClusteringSuganya Selvaraj0Eunmi Choi1Department of Financial Information Security, Kookmin University, Seoul 02707, Republic of KoreaDepartment of Financial Information Security, Kookmin University, Seoul 02707, Republic of KoreaText document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering problems compared to traditional clustering algorithms. The previous studies show that in SI algorithms, particle swarm optimization (PSO) provides an effective solution to text document clustering problems. This PSO still needs to be improved to avoid the problems such as premature convergence to local optima. In this paper, an approach called dynamic sub-swarm of PSO (subswarm-PSO) is proposed to improve the results of PSO for text document clustering problems and avoid the local optimum by improving the global search capabilities of PSO. The results of this proposed approach were compared with the standard PSO algorithm and K-means algorithm. As for performance assurance, the evaluation metric purity is used with six benchmark data sets. The experimental results of this study show that our proposed subswarm-PSO algorithm performs best with high purity comparing the standard PSO and K-means traditional algorithms and also the execution time of subswarm-PSO comparatively takes a little less than the standard PSO algorithm.https://www.mdpi.com/1424-8220/22/24/9653swarm intelligenceparticle swarm optimizationsub-swarm PSOtext document clustering
spellingShingle Suganya Selvaraj
Eunmi Choi
Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
Sensors
swarm intelligence
particle swarm optimization
sub-swarm PSO
text document clustering
title Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
title_full Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
title_fullStr Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
title_full_unstemmed Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
title_short Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
title_sort dynamic sub swarm approach of pso algorithms for text document clustering
topic swarm intelligence
particle swarm optimization
sub-swarm PSO
text document clustering
url https://www.mdpi.com/1424-8220/22/24/9653
work_keys_str_mv AT suganyaselvaraj dynamicsubswarmapproachofpsoalgorithmsfortextdocumentclustering
AT eunmichoi dynamicsubswarmapproachofpsoalgorithmsfortextdocumentclustering