Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning

Open Access
Authors
Publication date 2020
Book title HILDA 2020
Book subtitle Workshop on Human-In-the-Loop Data Analytics : co-located with SIGMOD 2020 : 19 June 2020, Portland, OR, USA
Event Workshop on Human-In-the-Loop Data Analytics
Article number 9
Number of pages 4
Publisher HILDA
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Surfacing and mitigating bias in ML pipelines is a complex topic, with a dire need to provide system-level support to data scientists. Humans should be empowered to debug these pipelines, in order to control for bias and to improve data quality and representativeness. We propose fair-DAGs, an open-source library that extracts directed acyclic graph (DAG) representations of the data flow in preprocessing pipelines for ML. The library subsequently instruments the pipelines with tracing and visualization code to capture changes in data distributions and identify distortions with respect to protected group membership as the data travels through the pipeline. We illustrate the utility of fair-DAGs with experiments on publicly available ML pipelines.
Document type Conference contribution
Language English
Published at https://ssc.io/publication/fairness-aware-instrumentation-of-preprocessing-pipelines-forml-hilda20/ https://hilda.io/2020/proceedings/HILDA2020_paper9.pdf
Other links https://hilda.io/2020/
Downloads
HILDA2020_paper9 (Final published version)
Permalink to this page
Back