QURMA: A TABLE EXTRACTION PIPELINE FOR KNOWLEDGE BASE POPULATION
In this paper, we propose a pipeline aimed at automatically extracting tables from heterogeneousWeb sources, such as HTML pages, pdf files and images. Table extraction is one of the activelydeveloping areas of Information Extraction, for which many applications, libraries and frameworksare currently...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Al-Farabi Kazakh National University
2022-06-01
|
Series: | Вестник КазНУ. Серия математика, механика, информатика |
Subjects: | |
Online Access: | https://bm.kaznu.kz/index.php/kaznu/article/view/1086/664 |