Towards abstractive captioning of infographics
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/119743 |
_version_ | 1810977369788252160 |
---|---|
author | Landman, Nathan, M. Eng. Massachusetts Institute of Technology |
author2 | Frédo Durand |
author_facet | Frédo Durand Landman, Nathan, M. Eng. Massachusetts Institute of Technology |
author_sort | Landman, Nathan, M. Eng. Massachusetts Institute of Technology |
collection | MIT |
description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. |
first_indexed | 2024-09-23T09:10:05Z |
format | Thesis |
id | mit-1721.1/119743 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T09:10:05Z |
publishDate | 2018 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1197432019-04-10T21:08:32Z Towards abstractive captioning of infographics Landman, Nathan, M. Eng. Massachusetts Institute of Technology Frédo Durand Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 91-94). Machine understanding of text-based narratives have predominantly focused on documents with rigid hierarchical structures and sequentially ordered inputs. These inputs include documents such as news stories, encyclopedia entries, books, and many others. However, little research has focused on understanding text-based information without this structure. Current text understanding models fail when information is presented in less structured ways, without a clear and pre-defined spatial arrangement of the content. This thesis explores a subset of components required for understanding infographics -- documents whose structure is not necessarily linear and whose content may involve a variety of images. We expand on state-of-the-art methodologies in character recognition and text summarization in order to better understand how to process content without a pre-determined spatial arrangement, and subsequently generate captions for given infographics automatically. To shine light at the reasoning behind the captions being generated, we develop a graphical user interface that helps visualize the portions of a document being used when generating specific parts of a caption. by Nathan Landman. M. Eng. 2018-12-18T19:48:09Z 2018-12-18T19:48:09Z 2018 2018 Thesis http://hdl.handle.net/1721.1/119743 1078689853 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 94 pages application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Landman, Nathan, M. Eng. Massachusetts Institute of Technology Towards abstractive captioning of infographics |
title | Towards abstractive captioning of infographics |
title_full | Towards abstractive captioning of infographics |
title_fullStr | Towards abstractive captioning of infographics |
title_full_unstemmed | Towards abstractive captioning of infographics |
title_short | Towards abstractive captioning of infographics |
title_sort | towards abstractive captioning of infographics |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/119743 |
work_keys_str_mv | AT landmannathanmengmassachusettsinstituteoftechnology towardsabstractivecaptioningofinfographics |