Learning to Validate the Predictions of Black Box Classifiers on Unseen Data
| Authors |
|
|---|---|
| Publication date | 2020 |
| Book title | SIGMOD '20 |
| Book subtitle | proceedings of the 2020 ACM SIGMOD International Conference on Management of Data : June 14-19, 2020, Portland, OR, USA |
| ISBN (electronic) |
|
| Event | 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 |
| Pages (from-to) | 1289-1299 |
| Number of pages | 11 |
| Publisher | New York, NY: Association for Computing Machinery |
| Organisations |
|
| Abstract |
Machine Learning (ML) models are difficult to maintain in production settings. In particular, deviations of the unseen serving data (for which we want to compute predictions) from the source data (on which the model was trained) pose a central challenge, especially when model training and prediction are outsourced via cloud services. Errors or shifts in the serving data can affect the predictive quality of a model, but are hard to detect for engineers operating ML deployments. We propose a simple approach to automate the validation of deployed ML models by estimating the model's predictive performance on unseen, unlabeled serving data. In contrast to existing work, we do not require explicit distributional assumptions on the dataset shift between the source and serving data. Instead, we rely on a programmatic specification of typical cases of dataset shift and data errors. We use this information to learn a performance predictor for a pretrained black box model that automatically raises alarms when it detects performance drops on unseen serving data. We experimentally evaluate our approach on various datasets, models and error types. We find that it reliably predicts the performance of black box models in the majority of cases, and outperforms several baselines even in the presence of unspecified data errors. |
| Document type | Conference contribution |
| Note | With supplemental video |
| Language | English |
| Published at | https://doi.org/10.1145/3318464.3380604 |
| Other links | https://www.scopus.com/pages/publications/85103484526 |
| Permalink to this page | |
