Bayesian Deep Learning via Subnetwork Inference

E. Daxberger; E. Nalisnick; J.U. Allingham; J. Antorán; J.M. Hernández-Lobato

Bayesian Deep Learning via Subnetwork Inference

Authors	E. Daxberger E. Nalisnick J.U. Allingham J. Antorán J.M. Hernández-Lobato
Publication date	2021
Journal	Proceedings of Machine Learning Research
Event	38th International Conference on Machine Learning
Volume \| Issue number	139
Pages (from-to)	2510-2521
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	The Bayesian paradigm has the potential to solve core issues of deep neural networks such as poor calibration and data inefficiency. Alas, scaling Bayesian inference to large weight spaces often requires restrictive approximations. In this work, we show that it suffices to perform inference over a small subset of model weights in order to obtain accurate predictive posteriors. The other weights are kept as point estimates. This subnetwork inference framework enables us to use expressive, otherwise intractable, posterior approximations over such subsets. In particular, we implement subnetwork linearized Laplace as a simple, scalable Bayesian deep learning method: We first obtain a MAP estimate of all weights and then infer a full-covariance Gaussian posterior over a subnetwork using the linearized Laplace approximation. We propose a subnetwork selection strategy that aims to maximally preserve the model’s predictive uncertainty. Empirically, our approach compares favorably to ensembles and less expressive posterior approximations over full networks.
Document type	Article
Note	International Conference on Machine Learning, 18-24 July 2021, Virtual. - With supplementary file.
Language	English
Published at	https://proceedings.mlr.press/v139/daxberger21a.html
Downloads	daxberger21a (Final published version)
Supplementary materials	daxberger21a-supp
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Bayesian Deep Learning via Subnetwork Inference