Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering
Text document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-12-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/22/24/9653 |
_version_ | 1797455402075422720 |
---|---|
author | Suganya Selvaraj Eunmi Choi |
author_facet | Suganya Selvaraj Eunmi Choi |
author_sort | Suganya Selvaraj |
collection | DOAJ |
description | Text document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering problems compared to traditional clustering algorithms. The previous studies show that in SI algorithms, particle swarm optimization (PSO) provides an effective solution to text document clustering problems. This PSO still needs to be improved to avoid the problems such as premature convergence to local optima. In this paper, an approach called dynamic sub-swarm of PSO (subswarm-PSO) is proposed to improve the results of PSO for text document clustering problems and avoid the local optimum by improving the global search capabilities of PSO. The results of this proposed approach were compared with the standard PSO algorithm and K-means algorithm. As for performance assurance, the evaluation metric purity is used with six benchmark data sets. The experimental results of this study show that our proposed subswarm-PSO algorithm performs best with high purity comparing the standard PSO and K-means traditional algorithms and also the execution time of subswarm-PSO comparatively takes a little less than the standard PSO algorithm. |
first_indexed | 2024-03-09T15:52:54Z |
format | Article |
id | doaj.art-c9181edf30034a3d98a0b0180ebf619b |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-09T15:52:54Z |
publishDate | 2022-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-c9181edf30034a3d98a0b0180ebf619b2023-11-24T17:53:07ZengMDPI AGSensors1424-82202022-12-012224965310.3390/s22249653Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document ClusteringSuganya Selvaraj0Eunmi Choi1Department of Financial Information Security, Kookmin University, Seoul 02707, Republic of KoreaDepartment of Financial Information Security, Kookmin University, Seoul 02707, Republic of KoreaText document clustering is one of the data mining techniques used in many real-world applications such as information retrieval from IoT Sensors data, duplicate content detection, and document organization. Swarm intelligence (SI) algorithms are suitable for solving complex text document clustering problems compared to traditional clustering algorithms. The previous studies show that in SI algorithms, particle swarm optimization (PSO) provides an effective solution to text document clustering problems. This PSO still needs to be improved to avoid the problems such as premature convergence to local optima. In this paper, an approach called dynamic sub-swarm of PSO (subswarm-PSO) is proposed to improve the results of PSO for text document clustering problems and avoid the local optimum by improving the global search capabilities of PSO. The results of this proposed approach were compared with the standard PSO algorithm and K-means algorithm. As for performance assurance, the evaluation metric purity is used with six benchmark data sets. The experimental results of this study show that our proposed subswarm-PSO algorithm performs best with high purity comparing the standard PSO and K-means traditional algorithms and also the execution time of subswarm-PSO comparatively takes a little less than the standard PSO algorithm.https://www.mdpi.com/1424-8220/22/24/9653swarm intelligenceparticle swarm optimizationsub-swarm PSOtext document clustering |
spellingShingle | Suganya Selvaraj Eunmi Choi Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering Sensors swarm intelligence particle swarm optimization sub-swarm PSO text document clustering |
title | Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering |
title_full | Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering |
title_fullStr | Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering |
title_full_unstemmed | Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering |
title_short | Dynamic Sub-Swarm Approach of PSO Algorithms for Text Document Clustering |
title_sort | dynamic sub swarm approach of pso algorithms for text document clustering |
topic | swarm intelligence particle swarm optimization sub-swarm PSO text document clustering |
url | https://www.mdpi.com/1424-8220/22/24/9653 |
work_keys_str_mv | AT suganyaselvaraj dynamicsubswarmapproachofpsoalgorithmsfortextdocumentclustering AT eunmichoi dynamicsubswarmapproachofpsoalgorithmsfortextdocumentclustering |