Semantic Web and Web Page Clustering Algorithms: A Landscape View

The major evolution of the semantic web has become exchanging data between applications in all domains of activities. Based on this vision, different applications in recent days, e.g. in the fields of community web portals, social networking, e-learning, multimedia retrieval, etc. have been designe...

Full description

Bibliographic Details
Main Authors: Ahmed J. Obaid, Tanusree Chatterjee, Abhishek Bhattacharya
Format: Article
Language:English
Published: European Alliance for Innovation (EAI) 2020-11-01
Series:EAI Endorsed Transactions on Energy Web
Subjects:
Online Access:https://publications.eai.eu/index.php/ew/article/view/803
Description
Summary:The major evolution of the semantic web has become exchanging data between applications in all domains of activities. Based on this vision, different applications in recent days, e.g. in the fields of community web portals, social networking, e-learning, multimedia retrieval, etc. have been designed. Due to growing number of web services, clustering of web resources becomes a valuable tool for semantic web mining. Clustering of internet objects like Internet web pages’ intimate new methods for grouping correlated content for better understanding and satisfies massive user query results in web pages’ search. Hence, web pages clustering algorithms should be able to handle massive irregular content and discover knowledge regardless of the web page complexity. These algorithms vary depending on the characteristics and data types. So, choosing the most appropriate algorithm is not an easy process as it should be accurate in terms of time and space complexity. Therefore, this paper rigorously surveys the most important algorithms of different types used for web page clustering. In addition, a comparative analysis of all such algorithms are provided in terms of several parameters. Finally, a brief discussion is provided on why web page clustering isimportant in emerging era of Semantic Web of Thing (SWoT) applications.
ISSN:2032-944X