Website Hosting Data and Analysis

We have collected a large dataset – more than 21 000 websites – through web-crawling the public resources of the Czech Internet. The proposed method for website hosting detection along with their geographic location and software were applied on the collected data to extend basic statistical informat...

Full description

Bibliographic Details
Main Authors: Petr Ilgner, Dan Komosný, Saeed Ur Rehman
Format: Article
Language:English
Published: Czech Statistical Office 2019-03-01
Series:Statistika: Statistics and Economy Journal
Subjects:
Online Access:https://www.czso.cz/documents/10180/88506450/32019719q1_033.pdf/eb1f0c0d-d789-461c-b0e9-16fee4bdf69e?version=1.0
Description
Summary:We have collected a large dataset – more than 21 000 websites – through web-crawling the public resources of the Czech Internet. The proposed method for website hosting detection along with their geographic location and software were applied on the collected data to extend basic statistical information about the Czech websites published by the national domain registrar CZ.NIC. For analysis, we divided the data into nine categories to show differences between them, for example, between the public and private sector. The procedures used in this paper may also be applied for an extended analysis of websites in other countries, for example, for verification of fulfillment of legal directives to be implemented by public sector.
ISSN:0322-788X
1804-8765