Towards abstractive captioning of infographics

Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.

Bibliographic Details
Main Author: Landman, Nathan, M. Eng. Massachusetts Institute of Technology
Other Authors: Frédo Durand
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2018
Subjects:
Online Access:http://hdl.handle.net/1721.1/119743
_version_ 1810977369788252160
author Landman, Nathan, M. Eng. Massachusetts Institute of Technology
author2 Frédo Durand
author_facet Frédo Durand
Landman, Nathan, M. Eng. Massachusetts Institute of Technology
author_sort Landman, Nathan, M. Eng. Massachusetts Institute of Technology
collection MIT
description Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.
first_indexed 2024-09-23T09:10:05Z
format Thesis
id mit-1721.1/119743
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T09:10:05Z
publishDate 2018
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1197432019-04-10T21:08:32Z Towards abstractive captioning of infographics Landman, Nathan, M. Eng. Massachusetts Institute of Technology Frédo Durand Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 91-94). Machine understanding of text-based narratives have predominantly focused on documents with rigid hierarchical structures and sequentially ordered inputs. These inputs include documents such as news stories, encyclopedia entries, books, and many others. However, little research has focused on understanding text-based information without this structure. Current text understanding models fail when information is presented in less structured ways, without a clear and pre-defined spatial arrangement of the content. This thesis explores a subset of components required for understanding infographics -- documents whose structure is not necessarily linear and whose content may involve a variety of images. We expand on state-of-the-art methodologies in character recognition and text summarization in order to better understand how to process content without a pre-determined spatial arrangement, and subsequently generate captions for given infographics automatically. To shine light at the reasoning behind the captions being generated, we develop a graphical user interface that helps visualize the portions of a document being used when generating specific parts of a caption. by Nathan Landman. M. Eng. 2018-12-18T19:48:09Z 2018-12-18T19:48:09Z 2018 2018 Thesis http://hdl.handle.net/1721.1/119743 1078689853 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 94 pages application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Landman, Nathan, M. Eng. Massachusetts Institute of Technology
Towards abstractive captioning of infographics
title Towards abstractive captioning of infographics
title_full Towards abstractive captioning of infographics
title_fullStr Towards abstractive captioning of infographics
title_full_unstemmed Towards abstractive captioning of infographics
title_short Towards abstractive captioning of infographics
title_sort towards abstractive captioning of infographics
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/119743
work_keys_str_mv AT landmannathanmengmassachusettsinstituteoftechnology towardsabstractivecaptioningofinfographics