2012
DOI: 10.1111/j.1467-9868.2011.01025.x
|View full text |Cite
|
Sign up to set email alerts
|

Catching up Faster by Switching Sooner: A Predictive Approach to Adaptive Estimation with an Application to the AIC–BIC Dilemma

Abstract: Summary. Prediction and estimation based on Bayesian model selection and model averaging, and derived methods such as the Bayesian information criterion BIC, do not always converge at the fastest possible rate. We identify the catch-up phenomenon as a novel explanation for the slow convergence of Bayesian methods, which inspires a modification of the Bayesian predictive distribution, called the switch distribution. When used as an adaptive estimator, the switch distribution does achieve optimal cumulative risk… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
48
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 69 publications
(48 citation statements)
references
References 86 publications
0
48
0
Order By: Relevance
“…Information criteria are valuable tools for model selection (Burnham and Anderson, ; Claeskens and Hjort, ; Konishi and Kitagawa, ). At a high level, they fall into two categories (Yang, ; van Erven et al ., ; Wit et al ., ). On one side, there are criteria that target good predictive behaviour of the selected model.…”
Section: Introductionmentioning
confidence: 99%
“…Information criteria are valuable tools for model selection (Burnham and Anderson, ; Claeskens and Hjort, ; Konishi and Kitagawa, ). At a high level, they fall into two categories (Yang, ; van Erven et al ., ; Wit et al ., ). On one side, there are criteria that target good predictive behaviour of the selected model.…”
Section: Introductionmentioning
confidence: 99%
“…However as conjectured in [19], it can probably be extended to other universal codes on the inner level, such as NML (for some special cases, it was proven in [41] that this is indeed the case). Note that (as we keep emphasising) we do not make any assumptions about the true distribution P * .…”
Section: Theorem 42 (Model Selection Consistencymentioning
confidence: 99%
“…For example, in many situations, one can achieve both consistency and optimal convergence rates; see Section 4.2.2. For more details, please refer to [41].…”
Section: Model Switchingmentioning
confidence: 99%
“…That is, it has been shown that the model with the highest posterior probability is also the one with the smallest accumulative prediction error even when none of the models under consideration is the data-generating model (Kass and Raftery, 1995; Wagenmakers et al, 2006; van Erven et al, 2012). …”
mentioning
confidence: 99%