WEB CONTENT EXTRACTION USING HYBRID APPROACH

The World Wide Web has rich source of voluminous and heterogeneous information which continues to expand in size and complexity. Many Web pages are unstructured and semi-structured, so it consists of noisy information like advertisement, links, headers, footers etc. This noisy information makes extr...

Full description

Bibliographic Details
Main Authors: K. Nethra, J. Anitha, G. Thilagavathi
Format: Article
Language:English
Published: ICT Academy of Tamil Nadu 2014-01-01
Series:ICTACT Journal on Soft Computing
Subjects:
Online Access:http://ictactjournals.in/paper/3_Paper_692_696.pdf