How Open Source Machine Learning Software Shapes AI

If we want a future where AI serves a plurality of interests, then we should pay attention to the factors that drive its success. While others have studied the importance of data, hardware, and models in directing the trajectory of AI, I argue that open source software is a neglected factor shaping...

Повний опис

Бібліографічні деталі
Автор: Langenkamp, Maximillian
Інші автори: Hadfield-Menell, Dylan
Формат: Дисертація
Опубліковано: Massachusetts Institute of Technology 2022
Онлайн доступ:https://hdl.handle.net/1721.1/145076
_version_ 1826217870646837248
author Langenkamp, Maximillian
author2 Hadfield-Menell, Dylan
author_facet Hadfield-Menell, Dylan
Langenkamp, Maximillian
author_sort Langenkamp, Maximillian
collection MIT
description If we want a future where AI serves a plurality of interests, then we should pay attention to the factors that drive its success. While others have studied the importance of data, hardware, and models in directing the trajectory of AI, I argue that open source software is a neglected factor shaping AI as a discipline. I start with the observation that almost all AI research and applications are built on machine learning open source software (MLOSS). This thesis presents four contributions. First, it quantifies the outsized impact of MLOSS by using Github contributions data. By contrasting the costs of MLOSS and its economic benefits, I find that the average dollar of MLOSS investment corresponds to at least $100 of global economic value created, corresponding to $30B of economic value created this year. Second, I leverage interviews with AI researchers and developers to develop a causal model of the effect of open sourcing on economic value. I argue that open sourcing creates value through three primary mechanisms: standardization of MLOSS tools, increased experimentation in AI research, and creation of commuities. Third, I analyze the various incentives behind MLOSS by examining three key factors: business strategy, sociotechnical factors, and ideological motivations. In the last section, I explore how MLOSS may help us understand the future of AI and make a number of probabilistic predictions. I intend this thesis to be useful for technologists and academics who want to analyze and critique AI, and policymakers who want to better understand and regulate AI systems.
first_indexed 2024-09-23T17:10:10Z
format Thesis
id mit-1721.1/145076
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T17:10:10Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1450762022-08-30T03:01:03Z How Open Source Machine Learning Software Shapes AI Langenkamp, Maximillian Hadfield-Menell, Dylan Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science If we want a future where AI serves a plurality of interests, then we should pay attention to the factors that drive its success. While others have studied the importance of data, hardware, and models in directing the trajectory of AI, I argue that open source software is a neglected factor shaping AI as a discipline. I start with the observation that almost all AI research and applications are built on machine learning open source software (MLOSS). This thesis presents four contributions. First, it quantifies the outsized impact of MLOSS by using Github contributions data. By contrasting the costs of MLOSS and its economic benefits, I find that the average dollar of MLOSS investment corresponds to at least $100 of global economic value created, corresponding to $30B of economic value created this year. Second, I leverage interviews with AI researchers and developers to develop a causal model of the effect of open sourcing on economic value. I argue that open sourcing creates value through three primary mechanisms: standardization of MLOSS tools, increased experimentation in AI research, and creation of commuities. Third, I analyze the various incentives behind MLOSS by examining three key factors: business strategy, sociotechnical factors, and ideological motivations. In the last section, I explore how MLOSS may help us understand the future of AI and make a number of probabilistic predictions. I intend this thesis to be useful for technologists and academics who want to analyze and critique AI, and policymakers who want to better understand and regulate AI systems. M.Eng. 2022-08-29T16:31:18Z 2022-08-29T16:31:18Z 2022-05 2022-05-27T16:19:20.659Z Thesis https://hdl.handle.net/1721.1/145076 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Langenkamp, Maximillian
How Open Source Machine Learning Software Shapes AI
title How Open Source Machine Learning Software Shapes AI
title_full How Open Source Machine Learning Software Shapes AI
title_fullStr How Open Source Machine Learning Software Shapes AI
title_full_unstemmed How Open Source Machine Learning Software Shapes AI
title_short How Open Source Machine Learning Software Shapes AI
title_sort how open source machine learning software shapes ai
url https://hdl.handle.net/1721.1/145076
work_keys_str_mv AT langenkampmaximillian howopensourcemachinelearningsoftwareshapesai