The Learning Rate Is Not a Constant Sandwich-Adjusted Markov Chain Monte Carlo Simulation
| Authors |
|
|---|---|
| Publication date | 25-09-2025 |
| Journal | Entropy |
| Article number | 999 |
| Volume | Issue number | 27 | 10 |
| Number of pages | 57 |
| Organisations |
|
| Abstract |
A fundamental limitation of maximum likelihood and Bayesian methods under model misspecification is that the asymptotic covariance matrix of the pseudo-true parameter vector (Formula presented.) is not the inverse of the Fisher information, but rather the sandwich covariance matrix (Formula presented.), where (Formula presented.) and (Formula presented.) are the sensitivity and variability matrices, respectively, evaluated at (Formula presented.) for training data record (Formula presented.). This paper makes three contributions. First, we review existing approaches to robust posterior sampling, including the open-faced sandwich adjustment and magnitude- and curvature-adjusted Markov chain Monte Carlo (MCMC) simulation. Second, we introduce a new sandwich-adjusted MCMC method. Unlike existing approaches that rely on arbitrary matrix square roots, eigendecompositions or a single scaling factor applied uniformly across the parameter space, our method employs a parameter-dependent learning rate (Formula presented.) that enables direction-specific tempering of the likelihood. This allows the sampler to capture directional asymmetries in the sandwich distribution, particularly under model misspecification or in small-sample regimes, and yields credible regions that remain valid when standard Bayesian inference underestimates uncertainty. Third, we propose information-theoretic diagnostics for quantifying model misspecification, including a strictly proper divergence score and scalar summaries based on the Frobenius norm, Earth mover’s distance, and the Herfindahl index. These principled diagnostics complement residual-based metrics for model evaluation by directly assessing the degree of misalignment between the sensitivity and variability matrices, (Formula presented.) and (Formula presented.). Applications to two parametric distributions and a rainfall-runoff case study with the Xinanjiang watershed model show that conventional Bayesian methods systematically underestimate uncertainty, while the proposed method yields asymptotically valid and robust uncertainty estimates. Together, these findings advocate for sandwich-based adjustments in Bayesian practice and workflows. |
| Document type | Article |
| Language | English |
| Published at | https://doi.org/10.3390/e27100999 |
| Other links | https://www.scopus.com/pages/publications/105020281043 |
| Downloads |
entropy-27-00999
(Final published version)
|
| Permalink to this page | |