Diversity in sociotechnical machine learning systems

There has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and t...

Full description

Bibliographic Details
Main Authors:	Sina Fazelpour, Maria De-Arteaga
Format:	Article
Language:	English
Published:	SAGE Publishing 2022-01-01
Series:	Big Data & Society
Online Access:	https://doi.org/10.1177/20539517221082027

_version_	1811272791137189888
author	Sina Fazelpour Maria De-Arteaga
author_facet	Sina Fazelpour Maria De-Arteaga
author_sort	Sina Fazelpour
collection	DOAJ
description	There has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and the precise mechanisms of its functional benefits, on the other. This gap is problematic because diversity is not a monolithic concept. Rather, different concepts of diversity are based on distinct rationales that should inform how we measure diversity in a given context. Similarly, the lack of specificity about the precise mechanisms underpinning diversity’s potential benefits can result in uninformative generalities, invalid experimental designs, and illicit interpretations of findings. In this work, we draw on research in philosophy, psychology, and social and organizational sciences to make three contributions: First, we introduce a taxonomy of different diversity concepts from philosophy of science, and explicate the distinct epistemic and political rationales underlying these concepts. Second, we provide an overview of mechanisms by which diversity can benefit group performance. Third, we situate these taxonomies of concepts and mechanisms in the lifecycle of sociotechnical machine learning systems and make a case for their usefulness in fair and accountable machine learning. We do so by illustrating how they clarify the discourse around diversity in the context of machine learning systems, promote the formulation of more precise research questions about diversity’s impact, and provide conceptual tools to further advance research and practice.
first_indexed	2024-04-12T22:46:38Z
format	Article
id	doaj.art-17190a0a04314fc08ac35aebbac13586
institution	Directory Open Access Journal
issn	2053-9517
language	English
last_indexed	2024-04-12T22:46:38Z
publishDate	2022-01-01
publisher	SAGE Publishing
record_format	Article
series	Big Data & Society
spelling	doaj.art-17190a0a04314fc08ac35aebbac135862022-12-22T03:13:30ZengSAGE PublishingBig Data & Society2053-95172022-01-01910.1177/20539517221082027Diversity in sociotechnical machine learning systemsSina Fazelpour0Maria De-Arteaga1 , USA , USAThere has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and the precise mechanisms of its functional benefits, on the other. This gap is problematic because diversity is not a monolithic concept. Rather, different concepts of diversity are based on distinct rationales that should inform how we measure diversity in a given context. Similarly, the lack of specificity about the precise mechanisms underpinning diversity’s potential benefits can result in uninformative generalities, invalid experimental designs, and illicit interpretations of findings. In this work, we draw on research in philosophy, psychology, and social and organizational sciences to make three contributions: First, we introduce a taxonomy of different diversity concepts from philosophy of science, and explicate the distinct epistemic and political rationales underlying these concepts. Second, we provide an overview of mechanisms by which diversity can benefit group performance. Third, we situate these taxonomies of concepts and mechanisms in the lifecycle of sociotechnical machine learning systems and make a case for their usefulness in fair and accountable machine learning. We do so by illustrating how they clarify the discourse around diversity in the context of machine learning systems, promote the formulation of more precise research questions about diversity’s impact, and provide conceptual tools to further advance research and practice.https://doi.org/10.1177/20539517221082027
spellingShingle	Sina Fazelpour Maria De-Arteaga Diversity in sociotechnical machine learning systems Big Data & Society
title	Diversity in sociotechnical machine learning systems
title_full	Diversity in sociotechnical machine learning systems
title_fullStr	Diversity in sociotechnical machine learning systems
title_full_unstemmed	Diversity in sociotechnical machine learning systems
title_short	Diversity in sociotechnical machine learning systems
title_sort	diversity in sociotechnical machine learning systems
url	https://doi.org/10.1177/20539517221082027
work_keys_str_mv	AT sinafazelpour diversityinsociotechnicalmachinelearningsystems AT mariadearteaga diversityinsociotechnicalmachinelearningsystems

Diversity in sociotechnical machine learning systems

Similar Items