Inconsistency of Bayesian Inference for Misspecified Linear Models, and a Proposal for Repairing It

Open Access
Authors
Publication date 2017
Journal Bayesian Analysis
Volume | Issue number 12 | 4
Pages (from-to) 1069-1103
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We empirically show that Bayesian inference can be inconsistent under misspecification in simple linear regression problems, both in a model averaging/selection and in a Bayesian ridge regression setting. We use the standard linear model, which assumes homoskedasticity, whereas the data are heteroskedastic (though, significantly, there are no outliers). As sample size increases, the posterior puts its mass on worse and worse models of ever higher dimension. This is caused by hypercompression, the phenomenon that the posterior puts its mass on distributions that have much larger KL divergence from the ground truth than their average, i.e. the Bayes predictive distribution. To remedy the problem, we equip the likelihood in Bayes’ theorem with an exponent called the learning rate, and we propose the SafeBayesian method to learn the learning rate from the data. SafeBayes tends to select small learning rates, and regularizes more, as soon as hypercompression takes place. Its results on our data are quite encouraging.
Document type Article
Note With supplemental materials
Language English
Published at https://doi.org/10.1214/17-BA1085 https://doi.org/10.1214/17-BA1085SUPP
Downloads
1510974325 (Final published version)
euclid.ba.1510974325 (Other version)
Permalink to this page
Back