The data science machine : emulating human intelligence in data science endeavors
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/107031 |
_version_ | 1826209688869404672 |
---|---|
author | Kanter, Max (James Max) |
author2 | Kalyan Veeramachaneni. |
author_facet | Kalyan Veeramachaneni. Kanter, Max (James Max) |
author_sort | Kanter, Max (James Max) |
collection | MIT |
description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015. |
first_indexed | 2024-09-23T14:27:15Z |
format | Thesis |
id | mit-1721.1/107031 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T14:27:15Z |
publishDate | 2017 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1070312019-04-11T06:07:24Z The data science machine : emulating human intelligence in data science endeavors Kanter, Max (James Max) Kalyan Veeramachaneni. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 87-88). Data scientists are responsible for many tasks in the data analysis process including formulating the question, generating features, building a model, and disseminating the results. The Data Science Machine is a automated system that emulates a human data scientist's ability to generate predictive models from raw data. In this thesis, we propose the Deep Feature Synthesis algorithm for automatically generating features for relational datasets. We implement this algorithm and test it on 3 data science competitions that have participation from nearly 1000 data science enthusiasts. In 2 of the 3 competitions we beat a majority of competitors, and in the third, we achieve 94% of the best competitor's score. Finally, we take steps towards incorporating the Data Science Machine into the data science process by implementing and evaluating an interface for users to interact with the Data Science Machine. by Max Kanter M. Eng. 2017-02-22T15:59:47Z 2017-02-22T15:59:47Z 2015 2015 Thesis http://hdl.handle.net/1721.1/107031 971493308 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 88 pages application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Kanter, Max (James Max) The data science machine : emulating human intelligence in data science endeavors |
title | The data science machine : emulating human intelligence in data science endeavors |
title_full | The data science machine : emulating human intelligence in data science endeavors |
title_fullStr | The data science machine : emulating human intelligence in data science endeavors |
title_full_unstemmed | The data science machine : emulating human intelligence in data science endeavors |
title_short | The data science machine : emulating human intelligence in data science endeavors |
title_sort | data science machine emulating human intelligence in data science endeavors |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/107031 |
work_keys_str_mv | AT kantermaxjamesmax thedatasciencemachineemulatinghumanintelligenceindatascienceendeavors AT kantermaxjamesmax datasciencemachineemulatinghumanintelligenceindatascienceendeavors |