Learning to Validate the Predictions of Black Box Classifiers on Unseen Data

Authors
Publication date 2020
Book title SIGMOD '20
Book subtitle proceedings of the 2020 ACM SIGMOD International Conference on Management of Data : June 14-19, 2020, Portland, OR, USA
ISBN (electronic)
  • 9781450367356
Event 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020
Pages (from-to) 1289-1299
Number of pages 11
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

Machine Learning (ML) models are difficult to maintain in production settings. In particular, deviations of the unseen serving data (for which we want to compute predictions) from the source data (on which the model was trained) pose a central challenge, especially when model training and prediction are outsourced via cloud services. Errors or shifts in the serving data can affect the predictive quality of a model, but are hard to detect for engineers operating ML deployments. We propose a simple approach to automate the validation of deployed ML models by estimating the model's predictive performance on unseen, unlabeled serving data. In contrast to existing work, we do not require explicit distributional assumptions on the dataset shift between the source and serving data. Instead, we rely on a programmatic specification of typical cases of dataset shift and data errors. We use this information to learn a performance predictor for a pretrained black box model that automatically raises alarms when it detects performance drops on unseen serving data. We experimentally evaluate our approach on various datasets, models and error types. We find that it reliably predicts the performance of black box models in the majority of cases, and outperforms several baselines even in the presence of unspecified data errors.

Document type Conference contribution
Note With supplemental video
Language English
Published at https://doi.org/10.1145/3318464.3380604
Other links https://www.scopus.com/pages/publications/85103484526
Permalink to this page
Back