Extracting network structure for international and Malaysia website via random walk

World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connec...

Full description

Bibliographic Details
Main Authors: Liang, Y. S. J., Chan, K. T., Zainuddin, H., M. Shah, N.
Format: Article
Published: Academy of Sciences Malaysia 2019
_version_ 1825951215905669120
author Liang, Y. S. J.
Chan, K. T.
Zainuddin, H.
M. Shah, N.
author_facet Liang, Y. S. J.
Chan, K. T.
Zainuddin, H.
M. Shah, N.
author_sort Liang, Y. S. J.
collection UPM
description World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connecting people around the world. Hence, studying its network topology will give us a better understanding of the sociology of content on the web as well as the possibility of predicting new emerging phenomena. In this paper, we construct networks by using random walk process that traverses the web at two popular websites, namely google.com (global) and mudah.my (local). We perform measurement such as degree distribution, diameter and average path length on the networks to determine various structural properties. We also analyse the network at the domain level to identify some top-level domains appearing in both networks in order to understand the connectivity of the web in different regions. Using centrality analysis, we also reveal some important and popular websites and domain from the networks.
first_indexed 2024-03-06T10:26:36Z
format Article
id upm.eprints-79835
institution Universiti Putra Malaysia
last_indexed 2024-03-06T10:26:36Z
publishDate 2019
publisher Academy of Sciences Malaysia
record_format dspace
spelling upm.eprints-798352022-11-14T03:08:00Z http://psasir.upm.edu.my/id/eprint/79835/ Extracting network structure for international and Malaysia website via random walk Liang, Y. S. J. Chan, K. T. Zainuddin, H. M. Shah, N. World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connecting people around the world. Hence, studying its network topology will give us a better understanding of the sociology of content on the web as well as the possibility of predicting new emerging phenomena. In this paper, we construct networks by using random walk process that traverses the web at two popular websites, namely google.com (global) and mudah.my (local). We perform measurement such as degree distribution, diameter and average path length on the networks to determine various structural properties. We also analyse the network at the domain level to identify some top-level domains appearing in both networks in order to understand the connectivity of the web in different regions. Using centrality analysis, we also reveal some important and popular websites and domain from the networks. Academy of Sciences Malaysia 2019 Article PeerReviewed Liang, Y. S. J. and Chan, K. T. and Zainuddin, H. and M. Shah, N. (2019) Extracting network structure for international and Malaysia website via random walk. ASM Science Journal, 12. pp. 1-10. ISSN 1823-6782; ESSN: 2682-8901 https://www.akademisains.gov.my/asmsj/asm-sc-j-12-2019/
spellingShingle Liang, Y. S. J.
Chan, K. T.
Zainuddin, H.
M. Shah, N.
Extracting network structure for international and Malaysia website via random walk
title Extracting network structure for international and Malaysia website via random walk
title_full Extracting network structure for international and Malaysia website via random walk
title_fullStr Extracting network structure for international and Malaysia website via random walk
title_full_unstemmed Extracting network structure for international and Malaysia website via random walk
title_short Extracting network structure for international and Malaysia website via random walk
title_sort extracting network structure for international and malaysia website via random walk
work_keys_str_mv AT liangysj extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk
AT chankt extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk
AT zainuddinh extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk
AT mshahn extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk