Diversity in sociotechnical machine learning systems

There has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and t...

Full description

Bibliographic Details
Main Authors: Sina Fazelpour, Maria De-Arteaga
Format: Article
Language:English
Published: SAGE Publishing 2022-01-01
Series:Big Data & Society
Online Access:https://doi.org/10.1177/20539517221082027
_version_ 1811272791137189888
author Sina Fazelpour
Maria De-Arteaga
author_facet Sina Fazelpour
Maria De-Arteaga
author_sort Sina Fazelpour
collection DOAJ
description There has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and the precise mechanisms of its functional benefits, on the other. This gap is problematic because diversity is not a monolithic concept. Rather, different concepts of diversity are based on distinct rationales that should inform how we measure diversity in a given context. Similarly, the lack of specificity about the precise mechanisms underpinning diversity’s potential benefits can result in uninformative generalities, invalid experimental designs, and illicit interpretations of findings. In this work, we draw on research in philosophy, psychology, and social and organizational sciences to make three contributions: First, we introduce a taxonomy of different diversity concepts from philosophy of science, and explicate the distinct epistemic and political rationales underlying these concepts. Second, we provide an overview of mechanisms by which diversity can benefit group performance. Third, we situate these taxonomies of concepts and mechanisms in the lifecycle of sociotechnical machine learning systems and make a case for their usefulness in fair and accountable machine learning. We do so by illustrating how they clarify the discourse around diversity in the context of machine learning systems, promote the formulation of more precise research questions about diversity’s impact, and provide conceptual tools to further advance research and practice.
first_indexed 2024-04-12T22:46:38Z
format Article
id doaj.art-17190a0a04314fc08ac35aebbac13586
institution Directory Open Access Journal
issn 2053-9517
language English
last_indexed 2024-04-12T22:46:38Z
publishDate 2022-01-01
publisher SAGE Publishing
record_format Article
series Big Data & Society
spelling doaj.art-17190a0a04314fc08ac35aebbac135862022-12-22T03:13:30ZengSAGE PublishingBig Data & Society2053-95172022-01-01910.1177/20539517221082027Diversity in sociotechnical machine learning systemsSina Fazelpour0Maria De-Arteaga1 , USA , USAThere has been a surge of recent interest in sociocultural diversity in machine learning research. Currently, however, there is a gap between discussions of measures and benefits of diversity in machine learning, on the one hand, and the broader research on the underlying concepts of diversity and the precise mechanisms of its functional benefits, on the other. This gap is problematic because diversity is not a monolithic concept. Rather, different concepts of diversity are based on distinct rationales that should inform how we measure diversity in a given context. Similarly, the lack of specificity about the precise mechanisms underpinning diversity’s potential benefits can result in uninformative generalities, invalid experimental designs, and illicit interpretations of findings. In this work, we draw on research in philosophy, psychology, and social and organizational sciences to make three contributions: First, we introduce a taxonomy of different diversity concepts from philosophy of science, and explicate the distinct epistemic and political rationales underlying these concepts. Second, we provide an overview of mechanisms by which diversity can benefit group performance. Third, we situate these taxonomies of concepts and mechanisms in the lifecycle of sociotechnical machine learning systems and make a case for their usefulness in fair and accountable machine learning. We do so by illustrating how they clarify the discourse around diversity in the context of machine learning systems, promote the formulation of more precise research questions about diversity’s impact, and provide conceptual tools to further advance research and practice.https://doi.org/10.1177/20539517221082027
spellingShingle Sina Fazelpour
Maria De-Arteaga
Diversity in sociotechnical machine learning systems
Big Data & Society
title Diversity in sociotechnical machine learning systems
title_full Diversity in sociotechnical machine learning systems
title_fullStr Diversity in sociotechnical machine learning systems
title_full_unstemmed Diversity in sociotechnical machine learning systems
title_short Diversity in sociotechnical machine learning systems
title_sort diversity in sociotechnical machine learning systems
url https://doi.org/10.1177/20539517221082027
work_keys_str_mv AT sinafazelpour diversityinsociotechnicalmachinelearningsystems
AT mariadearteaga diversityinsociotechnicalmachinelearningsystems