
arXiv: math/0406464
handle: 11590/153555
Often the goal of model selection is to choose a model for future prediction, and it is natural to measure the accuracy of a future prediction by squared error loss. Under the Bayesian approach, it is commonly perceived that the optimal predictive model is the model with highest posterior probability, but this is not necessarily the case. In this paper we show that, for selection among normal linear models, the optimal predictive model is often the median probability model, which is defined as the model consisting of those variables which have overall posterior probability greater than or equal to 1/2 of being in a model. The median probability model often differs from the highest probability model.
ANOVA, 62C10, Linear regression; mixed models, Bayesian linear models, Estimation in multivariate analysis, Bayesian inference, predictive distribution, Mathematics - Statistics Theory, Statistics Theory (math.ST), Bayesian problems; characterization of Bayes procedures, FOS: Mathematics, squared error loss, 62F15, MANOVA, 62F15 (Primary) 62C10. (Secondary), Bayesian linear models; predictive distribution; squared error loss; variable selection, variable selection
ANOVA, 62C10, Linear regression; mixed models, Bayesian linear models, Estimation in multivariate analysis, Bayesian inference, predictive distribution, Mathematics - Statistics Theory, Statistics Theory (math.ST), Bayesian problems; characterization of Bayes procedures, FOS: Mathematics, squared error loss, 62F15, MANOVA, 62F15 (Primary) 62C10. (Secondary), Bayesian linear models; predictive distribution; squared error loss; variable selection, variable selection
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 751 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 0.1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
