Optimal predictive model selection

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Jun 2004Embargo end date: 01 Jan 2004 Italy Publisher:Institute of Mathematical StatisticsJournal:The Annals of Statistics, volume 32 (issn: 0090-5364,

Copyright policy )

Authors: Barbieri, Maria Maddalena; Berger, James O.;

doi: 10.1214/009053604000000238 , 10.48550/arxiv.math/0406464

arXiv: math/0406464

handle: 11590/153555

Optimal predictive model selection

- Summary
- Subjects
- Metrics

Abstract

Often the goal of model selection is to choose a model for future prediction, and it is natural to measure the accuracy of a future prediction by squared error loss. Under the Bayesian approach, it is commonly perceived that the optimal predictive model is the model with highest posterior probability, but this is not necessarily the case. In this paper we show that, for selection among normal linear models, the optimal predictive model is often the median probability model, which is defined as the model consisting of those variables which have overall posterior probability greater than or equal to 1/2 of being in a model. The median probability model often differs from the highest probability model.

Country

Italy

Related Organizations

Duke University
United States
Roma Tre University
Italy

Keywords

ANOVA, 62C10, Linear regression; mixed models, Bayesian linear models, Estimation in multivariate analysis, Bayesian inference, predictive distribution, Mathematics - Statistics Theory, Statistics Theory (math.ST), Bayesian problems; characterization of Bayes procedures, FOS: Mathematics, squared error loss, 62F15, MANOVA, 62F15 (Primary) 62C10. (Secondary), Bayesian linear models; predictive distribution; squared error loss; variable selection, variable selection

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	751
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

751

Top 0.1%

Top 10%

Green

hybrid

Fields of Science (4) View all

Fields of Science