Visual datasets for artificial intelligence agents
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/119553 |
_version_ | 1826204646245400576 |
---|---|
author | Hilton, Erwin |
author2 | Tomaso Poggio. |
author_facet | Tomaso Poggio. Hilton, Erwin |
author_sort | Hilton, Erwin |
collection | MIT |
description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. |
first_indexed | 2024-09-23T12:58:45Z |
format | Thesis |
id | mit-1721.1/119553 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T12:58:45Z |
publishDate | 2018 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1195532019-04-10T22:19:22Z Visual datasets for artificial intelligence agents Hilton, Erwin Tomaso Poggio. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from PDF version of thesis. Includes bibliographical references (page 41). In this thesis, I designed and implemented two visual dataset generation tool frameworks. With these tools, I introduce procedurally generated new data to test VQA agents and other visual Al models on. The first tool is Spatial IQ Generative Dataset (SIQGD). This tool generates images based on the Raven's Progressive Matrices spatial IQ examination metric. The second tool is a collection of 3D models along with a Blender3D extension that renders images of the models from multiple viewpoints along with their depth maps. by Erwin Hilton. M. Eng. 2018-12-11T20:39:52Z 2018-12-11T20:39:52Z 2018 2018 Thesis http://hdl.handle.net/1721.1/119553 1076273503 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 41 pages application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Hilton, Erwin Visual datasets for artificial intelligence agents |
title | Visual datasets for artificial intelligence agents |
title_full | Visual datasets for artificial intelligence agents |
title_fullStr | Visual datasets for artificial intelligence agents |
title_full_unstemmed | Visual datasets for artificial intelligence agents |
title_short | Visual datasets for artificial intelligence agents |
title_sort | visual datasets for artificial intelligence agents |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/119553 |
work_keys_str_mv | AT hiltonerwin visualdatasetsforartificialintelligenceagents |