Revisiting the Berkeley Admissions data: Statistical Tests for Causal Hypotheses

Open Access
Authors
Publication date 2025
Journal Proceedings of Machine Learning Research
Event 41st Conference on Uncertainty in Artificial Intelligence, UAI 2025
Volume | Issue number 286
Pages (from-to) 271-295
Organisations
  • Faculty of Science (FNWI) - Korteweg-de Vries Institute for Mathematics (KdVI)
Abstract
Reasoning about fairness through correlation-based notions is rife with pitfalls. The 1973 University of California, Berkeley graduate school admissions case from [BickelHO75] is a classic example of one such pitfall, namely Simpson’s paradox. The discrepancy in admission rates among male and female applicants, in the aggregate data over all departments, vanishes when admission rates per department are examined. We reason about the Berkeley graduate school admissions case through a causal lens. In the process, we introduce a statistical test for causal hypothesis testing based on Pearl’s instrumental-variable inequalities [Pearl95]. We compare different causal notions of fairness that are based on graphical, counterfactual and interventional queries on the causal model, and develop statistical tests for these notions that use only observational data. We study the logical relations between notions, and show that while notions may not be equivalent, their corresponding statistical tests coincide for the case at hand. We believe that a thorough case-based causal analysis helps develop a more principled understanding of both causal hypothesis testing and fairness.
Document type Article
Note Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence : 21-25 July 2025, Rio Othon Palace, Rio de Janeiro, Brazil
Language English
Published at https://proceedings.mlr.press/v286/bhadane25a.html
Other links https://github.com/SourbhBh/BerkeleyCode
Downloads
bhadane25a (Final published version)
Permalink to this page
Back