Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident

With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has prompted many specialists and scholars to innovate...

Full description

Bibliographic Details
Main Authors: H. Hu, Y. J. Ge
Format: Article
Language:English
Published: Copernicus Publications 2013-11-01
Series:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:http://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XL-4-W3/71/2013/isprsarchives-XL-4-W3-71-2013.pdf
_version_ 1811322966870327296
author H. Hu
Y. J. Ge
author_facet H. Hu
Y. J. Ge
author_sort H. Hu
collection DOAJ
description With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has prompted many specialists and scholars to innovate their research. Though politics were integrally involved in the hyperlinked word issues since 1990s, automatic assembly of different geospatial web and distributed geospatial information systems utilizing service chaining have explored and built recently, the information collection and data visualisation of geo-events have always faced the bottleneck of traditional manual analysis because of the sensibility, complexity, relativity, timeliness and unexpected characteristics of political events. Based on the framework of Heritrix and the analysis of web-based text, word frequency, sentiment tendency and dissemination path of the Huangyan Island incident is studied here by combining web crawler technology and the text analysis method. The results indicate that tag cloud, frequency map, attitudes pie, individual mention ratios and dissemination flow graph based on the data collection and processing not only highlight the subject and theme vocabularies of related topics but also certain issues and problems behind it. Being able to express the time-space relationship of text information and to disseminate the information regarding geo-events, the text analysis of network information based on focused web crawler technology can be a tool for understanding the formation and diffusion of web-based public opinions in political events.
first_indexed 2024-04-13T13:45:42Z
format Article
id doaj.art-78fca722af92448ea1df3d60e59fd4d6
institution Directory Open Access Journal
issn 1682-1750
2194-9034
language English
last_indexed 2024-04-13T13:45:42Z
publishDate 2013-11-01
publisher Copernicus Publications
record_format Article
series The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
spelling doaj.art-78fca722af92448ea1df3d60e59fd4d62022-12-22T02:44:30ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342013-11-01XL-4/W3717810.5194/isprsarchives-XL-4-W3-71-2013Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island IncidentH. Hu0Y. J. Ge1School of Geography Beijing Normal University 100875 No. 19 Xin Jie Kou Wai Street, Haidian District, Beijing, P.R. ChinaSchool of Geography Beijing Normal University 100875 No. 19 Xin Jie Kou Wai Street, Haidian District, Beijing, P.R. ChinaWith the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has prompted many specialists and scholars to innovate their research. Though politics were integrally involved in the hyperlinked word issues since 1990s, automatic assembly of different geospatial web and distributed geospatial information systems utilizing service chaining have explored and built recently, the information collection and data visualisation of geo-events have always faced the bottleneck of traditional manual analysis because of the sensibility, complexity, relativity, timeliness and unexpected characteristics of political events. Based on the framework of Heritrix and the analysis of web-based text, word frequency, sentiment tendency and dissemination path of the Huangyan Island incident is studied here by combining web crawler technology and the text analysis method. The results indicate that tag cloud, frequency map, attitudes pie, individual mention ratios and dissemination flow graph based on the data collection and processing not only highlight the subject and theme vocabularies of related topics but also certain issues and problems behind it. Being able to express the time-space relationship of text information and to disseminate the information regarding geo-events, the text analysis of network information based on focused web crawler technology can be a tool for understanding the formation and diffusion of web-based public opinions in political events.http://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XL-4-W3/71/2013/isprsarchives-XL-4-W3-71-2013.pdf
spellingShingle H. Hu
Y. J. Ge
Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
title Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
title_full Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
title_fullStr Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
title_full_unstemmed Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
title_short Using Web Crawler Technology for Text Analysis of Geo-Events: A Case Study of the Huangyan Island Incident
title_sort using web crawler technology for text analysis of geo events a case study of the huangyan island incident
url http://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XL-4-W3/71/2013/isprsarchives-XL-4-W3-71-2013.pdf
work_keys_str_mv AT hhu usingwebcrawlertechnologyfortextanalysisofgeoeventsacasestudyofthehuangyanislandincident
AT yjge usingwebcrawlertechnologyfortextanalysisofgeoeventsacasestudyofthehuangyanislandincident