Multi-Purpose Dataset of Webpages and Its Content Blocks: Design and Structure Validation

The need for automated data extraction is continuously growing due to the constant addition of information to the worldwide web. Researchers are developing new data extraction methods to achieve increased performance compared to existing methods. Comparing algorithms to evaluate their performance is...

Full description

Bibliographic Details
Main Authors: Kiril Griazev, Simona Ramanauskaitė
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/8/3319