
AbstractWe give a permutation approach to validation (estimation of out‐sample error). One typical use of validation is model selection. We establish the legitimacy of the proposed permutation complexity by proving a uniform bound on the out‐sample error, similar to a Vapnik‐Chervonenkis (VC)‐style bound. We extensively demonstrate this approach experimentally on synthetic data, standard data sets from the UCI‐repository, and a novel diffusion data set. The out‐of‐sample error estimates are comparable to cross validation (CV); yet, the method is more efficient and robust, being less susceptible to overfitting during model selection. Copyright © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 361‐380 2010
model selection, uniform bounds, Statistics, out-sample error, Computer science
model selection, uniform bounds, Statistics, out-sample error, Computer science
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 5 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
