The Learning Rate Is Not a Constant

Jasper A. Vrugt; Cees G.H. Diks

doi:https://doi.org/10.3390/e27100999

The Learning Rate Is Not a Constant Sandwich-Adjusted Markov Chain Monte Carlo Simulation

Authors	Jasper A. Vrugt Cees G.H. Diks
Publication date	25-09-2025
Journal	Entropy
Article number	999
Volume \| Issue number	27 \| 10
Number of pages	57
Organisations	Faculty of Economics and Business (FEB) - Amsterdam School of Economics Research Institute (ASE-RI) Faculty of Economics and Business (FEB)
Abstract	A fundamental limitation of maximum likelihood and Bayesian methods under model misspecification is that the asymptotic covariance matrix of the pseudo-true parameter vector (Formula presented.) is not the inverse of the Fisher information, but rather the sandwich covariance matrix (Formula presented.), where (Formula presented.) and (Formula presented.) are the sensitivity and variability matrices, respectively, evaluated at (Formula presented.) for training data record (Formula presented.). This paper makes three contributions. First, we review existing approaches to robust posterior sampling, including the open-faced sandwich adjustment and magnitude- and curvature-adjusted Markov chain Monte Carlo (MCMC) simulation. Second, we introduce a new sandwich-adjusted MCMC method. Unlike existing approaches that rely on arbitrary matrix square roots, eigendecompositions or a single scaling factor applied uniformly across the parameter space, our method employs a parameter-dependent learning rate (Formula presented.) that enables direction-specific tempering of the likelihood. This allows the sampler to capture directional asymmetries in the sandwich distribution, particularly under model misspecification or in small-sample regimes, and yields credible regions that remain valid when standard Bayesian inference underestimates uncertainty. Third, we propose information-theoretic diagnostics for quantifying model misspecification, including a strictly proper divergence score and scalar summaries based on the Frobenius norm, Earth mover’s distance, and the Herfindahl index. These principled diagnostics complement residual-based metrics for model evaluation by directly assessing the degree of misalignment between the sensitivity and variability matrices, (Formula presented.) and (Formula presented.). Applications to two parametric distributions and a rainfall-runoff case study with the Xinanjiang watershed model show that conventional Bayesian methods systematically underestimate uncertainty, while the proposed method yields asymptotically valid and robust uncertainty estimates. Together, these findings advocate for sandwich-based adjustments in Bayesian practice and workflows.
Document type	Article
Language	English
Published at	https://doi.org/10.3390/e27100999
Other links	https://www.scopus.com/pages/publications/105020281043
Downloads	entropy-27-00999 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

The Learning Rate Is Not a Constant Sandwich-Adjusted Markov Chain Monte Carlo Simulation