SInFo – Structure-Driven Incremental Forum Crawler That Optimizes User-Generated Content Retrieval
In this paper we present a Structure-driven Incremental Forum crawler (SInFo) that targets the latest content in crawling cycles. On a Web forum, user generated content is almost never changed or deleted, but it is constantly added. There is a wide spectrum of forum technologies that have different...
Main Authors: | Milos Pavkovic, Jelica Protic |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8832156/ |
Similar Items
-
Design and Implementation of a Language Specific Crawler to Improve Crawling of Persian Web Documents
by: Masomeh Azimzadeh, et al.
Published: (2009-12-01) -
Effective Web Page Crawler
by: Hilal Hadi Saleh, et al.
Published: (2011-02-01) -
Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
by: Ariel Guillermo Sánchez Paipilla, et al.
Published: (2022-09-01) -
<span style="font-variant: small-caps">esCorpius-m</span>: A Massive Multilingual Crawling Corpus with a Focus on Spanish
by: Asier Gutiérrez-Fandiño, et al.
Published: (2023-11-01) -
Crawling PubMed with web agents for literature search and alerting services
by: Carlos CARVALHAL, et al.
Published: (2013-05-01)