Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning
| Authors |
|
|---|---|
| Publication date | 2020 |
| Book title | HILDA 2020 |
| Book subtitle | Workshop on Human-In-the-Loop Data Analytics : co-located with SIGMOD 2020 : 19 June 2020, Portland, OR, USA |
| Event | Workshop on Human-In-the-Loop Data Analytics |
| Article number | 9 |
| Number of pages | 4 |
| Publisher | HILDA |
| Organisations |
|
| Abstract |
Surfacing and mitigating bias in ML pipelines is a complex topic, with a dire need to provide system-level support to data scientists. Humans should be empowered to debug these pipelines, in order to control for bias and to improve data quality and representativeness. We propose fair-DAGs, an open-source library that extracts directed acyclic graph (DAG) representations of the data flow in preprocessing pipelines for ML. The library subsequently instruments the pipelines with tracing and visualization code to capture changes in data distributions and identify distortions with respect to protected group membership as the data travels through the pipeline. We illustrate the utility of fair-DAGs with experiments on publicly available ML pipelines.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://ssc.io/publication/fairness-aware-instrumentation-of-preprocessing-pipelines-forml-hilda20/ https://hilda.io/2020/proceedings/HILDA2020_paper9.pdf |
| Other links | https://hilda.io/2020/ |
| Downloads |
HILDA2020_paper9
(Final published version)
|
| Permalink to this page | |
