Gendered behavior as a disadvantage in open source software development

Abstract Women are severely marginalized in software development, especially in open source. In this article we argue that disadvantage is more due to gendered behavior than to categorical discrimination: women are at a disadvantage because of what they do, rather than because of who they are. Using...

Full description

Bibliographic Details
Main Authors: Balazs Vedres, Orsolya Vasarhelyi
Format: Article
Language:English
Published: SpringerOpen 2019-07-01
Series:EPJ Data Science
Subjects:
Online Access:http://link.springer.com/article/10.1140/epjds/s13688-019-0202-z
_version_ 1818658593407238144
author Balazs Vedres
Orsolya Vasarhelyi
author_facet Balazs Vedres
Orsolya Vasarhelyi
author_sort Balazs Vedres
collection DOAJ
description Abstract Women are severely marginalized in software development, especially in open source. In this article we argue that disadvantage is more due to gendered behavior than to categorical discrimination: women are at a disadvantage because of what they do, rather than because of who they are. Using data on entire careers of users from GitHub.com, we develop a measure to capture the gendered pattern of behavior: We use a random forest prediction of being female (as opposed to being male) by behavioral choices in the level of activity, specialization in programming languages, and choice of partners. We test differences in success and survival along both categorical gender and the gendered pattern of behavior. We find that 84.5% of women’s disadvantage (compared to men) in success and 34.8% of their disadvantage in survival are due to the female pattern of their behavior. Men are also disadvantaged along their interquartile range of the female pattern of their behavior, and users who don’t reveal their gender suffer an even more drastic disadvantage in survival probability. Moreover, we do not see evidence for any reduction of these inequalities in time. Our findings are robust to noise in gender recognition, and to taking into account particular programming languages, or decision tree classes of gendered behavior. Our results suggest that fighting categorical gender discrimination will have a limited impact on gender inequalities in open source software development, and that gender hiding is not a viable strategy for women.
first_indexed 2024-12-17T03:59:50Z
format Article
id doaj.art-df77f048d96f40669ccc775a5049c81b
institution Directory Open Access Journal
issn 2193-1127
language English
last_indexed 2024-12-17T03:59:50Z
publishDate 2019-07-01
publisher SpringerOpen
record_format Article
series EPJ Data Science
spelling doaj.art-df77f048d96f40669ccc775a5049c81b2022-12-21T22:04:33ZengSpringerOpenEPJ Data Science2193-11272019-07-018111810.1140/epjds/s13688-019-0202-zGendered behavior as a disadvantage in open source software developmentBalazs Vedres0Orsolya Vasarhelyi1Oxford Internet Institute, University of OxfordDepartment of Network and Data Science, Central European UniversityAbstract Women are severely marginalized in software development, especially in open source. In this article we argue that disadvantage is more due to gendered behavior than to categorical discrimination: women are at a disadvantage because of what they do, rather than because of who they are. Using data on entire careers of users from GitHub.com, we develop a measure to capture the gendered pattern of behavior: We use a random forest prediction of being female (as opposed to being male) by behavioral choices in the level of activity, specialization in programming languages, and choice of partners. We test differences in success and survival along both categorical gender and the gendered pattern of behavior. We find that 84.5% of women’s disadvantage (compared to men) in success and 34.8% of their disadvantage in survival are due to the female pattern of their behavior. Men are also disadvantaged along their interquartile range of the female pattern of their behavior, and users who don’t reveal their gender suffer an even more drastic disadvantage in survival probability. Moreover, we do not see evidence for any reduction of these inequalities in time. Our findings are robust to noise in gender recognition, and to taking into account particular programming languages, or decision tree classes of gendered behavior. Our results suggest that fighting categorical gender discrimination will have a limited impact on gender inequalities in open source software development, and that gender hiding is not a viable strategy for women.http://link.springer.com/article/10.1140/epjds/s13688-019-0202-zGender inequalityGendered behaviorSoftware developmentOpen source
spellingShingle Balazs Vedres
Orsolya Vasarhelyi
Gendered behavior as a disadvantage in open source software development
EPJ Data Science
Gender inequality
Gendered behavior
Software development
Open source
title Gendered behavior as a disadvantage in open source software development
title_full Gendered behavior as a disadvantage in open source software development
title_fullStr Gendered behavior as a disadvantage in open source software development
title_full_unstemmed Gendered behavior as a disadvantage in open source software development
title_short Gendered behavior as a disadvantage in open source software development
title_sort gendered behavior as a disadvantage in open source software development
topic Gender inequality
Gendered behavior
Software development
Open source
url http://link.springer.com/article/10.1140/epjds/s13688-019-0202-z
work_keys_str_mv AT balazsvedres genderedbehaviorasadisadvantageinopensourcesoftwaredevelopment
AT orsolyavasarhelyi genderedbehaviorasadisadvantageinopensourcesoftwaredevelopment