
arXiv: 0801.2934
Let $(X,Y)$ be a random variable consisting of an observed feature vector $X\in \mathcal{X}$ and an unobserved class label $Y\in \{1,2,...,L\}$ with unknown joint distribution. In addition, let $\mathcal{D}$ be a training data set consisting of $n$ completely observed independent copies of $(X,Y)$. Usual classification procedures provide point predictors (classifiers) $\widehat{Y}(X,\mathcal{D})$ of $Y$ or estimate the conditional distribution of $Y$ given $X$. In order to quantify the certainty of classifying $X$ we propose to construct for each $��=1,2,...,L$ a p-value $��_��(X,\mathcal{D})$ for the null hypothesis that $Y=��$, treating $Y$ temporarily as a fixed parameter. In other words, the point predictor $\widehat{Y}(X,\mathcal{D})$ is replaced with a prediction region for $Y$ with a certain confidence. We argue that (i) this approach is advantageous over traditional approaches and (ii) any reasonable classifier can be modified to yield nonparametric p-values. We discuss issues such as optimality, single use and multiple use validity, as well as computational and graphical aspects.
Published in at http://dx.doi.org/10.1214/08-EJS245 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)
FOS: Computer and information sciences, validity, Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), 62C05, 62F25, 62G09, 62G15, 62H30 (Primary), Nonparametric tolerance and confidence regions, nearest neighbors, Statistics - Machine Learning, 62G09, FOS: Mathematics, Nonparametric statistical resampling methods, 62F25, nonparametric, typicality index, 62C05, Parametric tolerance and confidence regions, Classification and discrimination; cluster analysis (statistical aspects), General considerations in statistical decision theory, prediction region, ROC curve, optimality, 62G15, 62H30, permutation test
FOS: Computer and information sciences, validity, Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), 62C05, 62F25, 62G09, 62G15, 62H30 (Primary), Nonparametric tolerance and confidence regions, nearest neighbors, Statistics - Machine Learning, 62G09, FOS: Mathematics, Nonparametric statistical resampling methods, 62F25, nonparametric, typicality index, 62C05, Parametric tolerance and confidence regions, Classification and discrimination; cluster analysis (statistical aspects), General considerations in statistical decision theory, prediction region, ROC curve, optimality, 62G15, 62H30, permutation test
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
