Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired...

Full description

Bibliographic Details
Main Authors: A. Pouramini, S. Khaje Hassani, Sh. Nasiri
Format: Article
Language:English
Published: Shahrood University of Technology 2018-07-01
Series:Journal of Artificial Intelligence and Data Mining
Subjects:
Online Access:http://jad.shahroodut.ac.ir/article_990_3d26710f01637300b6a97ff0bf1441ac.pdf