Programmed differently? Testing for gender differences in Python programming style and quality on GitHub

Open Access
Authors
Publication date 01-01-2024
Journal Journal of Computer-Mediated Communication
Article number zmad049
Volume | Issue number 29 | 1
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

The underrepresentation of women in open-source software is frequently attributed to women’s lack of innate aptitude compared to men: natural gender differences in technical ability (Trinkenreich et al., 2021). Approaching code as a form of communication, I conduct a novel empirical study of gender differences in Python programming on GitHub. Based on 1,728 open-source projects, I ask if there is a gender difference in the quality and style of Python code measured in adherence to PEP-8 guidelines. I found significant gender differences in structure and how Python files are organized. While there is gendered variation in programming style, there is no evidence of gender difference in code quality. Using a Random Forest model, I show that the gender of a programmer can be predicted from the style of their Python code. The study concludes that gender differences in Python code are a matter of style, not quality.

Document type Article
Note Publisher Copyright: # The Author(s) 2024. Published by Oxford University Press on behalf of International Communication Association.
Language English
Published at https://doi.org/10.1093/jcmc/zmad049
Other links https://www.scopus.com/pages/publications/85184850664 https://github.com/SianJMBrooke/ProgrammedDifferently https://www.sianbrooke.com/programmed-differently
Downloads
zmad049 (Final published version)
Permalink to this page
Back