QURMA: A TABLE EXTRACTION PIPELINE FOR KNOWLEDGE BASE POPULATION

In this paper, we propose a pipeline aimed at automatically extracting tables from heterogeneousWeb sources, such as HTML pages, pdf files and images. Table extraction is one of the activelydeveloping areas of Information Extraction, for which many applications, libraries and frameworksare currently...

Full description

Bibliographic Details
Main Authors: A. B. Nugumanova, K. S. Apayev, Y. M. Baiburin, M. Mansurova, A. G. Ospan
Format: Article
Language:English
Published: Al-Farabi Kazakh National University 2022-06-01
Series:Вестник КазНУ. Серия математика, механика, информатика
Subjects:
Online Access:https://bm.kaznu.kz/index.php/kaznu/article/view/1086/664