Adapting the Linearised Laplace Model Evidence for Modern Deep Learning

J. Antorán; D. Janz; J.U. Allingham; E. Daxberger; R. Barbano; E. Nalisnick; J.M. Hernández-Lobato

doi:https://doi.org/10.48550/arXiv.2206.08900

Adapting the Linearised Laplace Model Evidence for Modern Deep Learning

Authors	J. Antorán D. Janz J.U. Allingham E. Daxberger R. Barbano E. Nalisnick J.M. Hernández-Lobato
Publication date	2022
Journal	Proceedings of Machine Learning Research
Event	39th International Conference on Machine Learning
Volume \| Issue number	162
Pages (from-to)	796-821
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	The linearised Laplace method for estimating model uncertainty has received renewed attention in the Bayesian deep learning community. The method provides reliable error bars and admits a closed-form expression for the model evidence, allowing for scalable selection of model hyperparameters. In this work, we examine the assumptions behind this method, particularly in conjunction with model selection. We show that these interact poorly with some now-standard tools of deep learning–stochastic approximation methods and normalisation layers–and make recommendations for how to better adapt this classic method to the modern setting. We provide theoretical support for our recommendations and validate them empirically on MLPs, classic CNNs, residual networks with and without normalisation layers, generative autoencoders and transformers.
Document type	Article
Note	International Conference on Machine Learning, 17-23 July 2022, Baltimore, Maryland, USA
Language	English
Published at	https://doi.org/10.48550/arXiv.2206.08900
Published at	https://proceedings.mlr.press/v162/antoran22a.html
Downloads	antoran22a (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Adapting the Linearised Laplace Model Evidence for Modern Deep Learning