Automated Pipelines for Information Extraction from Semi-Structured Documents in Structured Format

As documents are one of the main tools for storing and communicating information, there have been a large amount of eff orts towards developing methods to parse information from them automatically. While many parts of this industry are automated, there are still scenarios where certain types of docu...

Full description

Bibliographic Details
Main Author: Chu, Jung Soo
Other Authors: Gupta, Amar
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/151614