Extracting network structure for international and Malaysia website via random walk
World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connec...
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Academy of Sciences Malaysia
2019
|
_version_ | 1825951215905669120 |
---|---|
author | Liang, Y. S. J. Chan, K. T. Zainuddin, H. M. Shah, N. |
author_facet | Liang, Y. S. J. Chan, K. T. Zainuddin, H. M. Shah, N. |
author_sort | Liang, Y. S. J. |
collection | UPM |
description | World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connecting people around the world. Hence, studying its network topology will give us a better understanding of the sociology of content on the web as well as the possibility of predicting new emerging phenomena. In this paper, we construct networks by using random walk process that traverses the web at two popular websites, namely google.com (global) and mudah.my (local). We perform measurement such as degree distribution, diameter and average path length on the networks to determine various structural properties. We also analyse the network at the domain level to identify some top-level domains appearing in both networks in order to understand the connectivity of the web in different regions. Using centrality analysis, we also reveal some important and popular websites and domain from the networks. |
first_indexed | 2024-03-06T10:26:36Z |
format | Article |
id | upm.eprints-79835 |
institution | Universiti Putra Malaysia |
last_indexed | 2024-03-06T10:26:36Z |
publishDate | 2019 |
publisher | Academy of Sciences Malaysia |
record_format | dspace |
spelling | upm.eprints-798352022-11-14T03:08:00Z http://psasir.upm.edu.my/id/eprint/79835/ Extracting network structure for international and Malaysia website via random walk Liang, Y. S. J. Chan, K. T. Zainuddin, H. M. Shah, N. World Wide Web is an information retrieval system accessible via the Internet. Since all the web resources and documents are interlinks with hypertext links, it formed a huge and complex information network. Besides information, the web is also a primary tool for commercial, entertainment and connecting people around the world. Hence, studying its network topology will give us a better understanding of the sociology of content on the web as well as the possibility of predicting new emerging phenomena. In this paper, we construct networks by using random walk process that traverses the web at two popular websites, namely google.com (global) and mudah.my (local). We perform measurement such as degree distribution, diameter and average path length on the networks to determine various structural properties. We also analyse the network at the domain level to identify some top-level domains appearing in both networks in order to understand the connectivity of the web in different regions. Using centrality analysis, we also reveal some important and popular websites and domain from the networks. Academy of Sciences Malaysia 2019 Article PeerReviewed Liang, Y. S. J. and Chan, K. T. and Zainuddin, H. and M. Shah, N. (2019) Extracting network structure for international and Malaysia website via random walk. ASM Science Journal, 12. pp. 1-10. ISSN 1823-6782; ESSN: 2682-8901 https://www.akademisains.gov.my/asmsj/asm-sc-j-12-2019/ |
spellingShingle | Liang, Y. S. J. Chan, K. T. Zainuddin, H. M. Shah, N. Extracting network structure for international and Malaysia website via random walk |
title | Extracting network structure for international and Malaysia website via random walk |
title_full | Extracting network structure for international and Malaysia website via random walk |
title_fullStr | Extracting network structure for international and Malaysia website via random walk |
title_full_unstemmed | Extracting network structure for international and Malaysia website via random walk |
title_short | Extracting network structure for international and Malaysia website via random walk |
title_sort | extracting network structure for international and malaysia website via random walk |
work_keys_str_mv | AT liangysj extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk AT chankt extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk AT zainuddinh extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk AT mshahn extractingnetworkstructureforinternationalandmalaysiawebsiteviarandomwalk |