Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning

K. Yang; B. Huang; J. Stoyanovich; S. Schelter

Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning

Authors	K. Yang B. Huang J. Stoyanovich S. Schelter
Publication date	2020
Book title	HILDA 2020
Book subtitle	Workshop on Human-In-the-Loop Data Analytics : co-located with SIGMOD 2020 : 19 June 2020, Portland, OR, USA
Event	Workshop on Human-In-the-Loop Data Analytics
Article number	9
Number of pages	4
Publisher	HILDA
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Surfacing and mitigating bias in ML pipelines is a complex topic, with a dire need to provide system-level support to data scientists. Humans should be empowered to debug these pipelines, in order to control for bias and to improve data quality and representativeness. We propose fair-DAGs, an open-source library that extracts directed acyclic graph (DAG) representations of the data flow in preprocessing pipelines for ML. The library subsequently instruments the pipelines with tracing and visualization code to capture changes in data distributions and identify distortions with respect to protected group membership as the data travels through the pipeline. We illustrate the utility of fair-DAGs with experiments on publicly available ML pipelines.
Document type	Conference contribution
Language	English
Published at	https://ssc.io/publication/fairness-aware-instrumentation-of-preprocessing-pipelines-forml-hilda20/ (Accepted author manuscript) https://hilda.io/2020/proceedings/HILDA2020_paper9.pdf (Final published version)
Other links	https://hilda.io/2020/
Downloads	HILDA2020_paper9 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning