P-values for classification

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Jan 2008Embargo end date: 01 Jan 2008 Germany Publisher:Institute of Mathematical StatisticsJournal:Electronic Journal of Statistics, volume 2 (issn: 1935-7524,

Copyright policy )Funded by:SNSF | Quantifizierung der Unsic...

Authors: Dümbgen Lutz; Igl Bernd-Wolfgang; Munk Axel;

doi: 10.1214/08-ejs245 , 10.48550/arxiv.0801.2934 , 10.7892/boris.36879

arXiv: 0801.2934

P-values for classification

- Summary
- Subjects
- Metrics

Abstract

Let $(X,Y)$ be a random variable consisting of an observed feature vector $X\in \mathcal{X}$ and an unobserved class label $Y\in \{1,2,...,L\}$ with unknown joint distribution. In addition, let $\mathcal{D}$ be a training data set consisting of $n$ completely observed independent copies of $(X,Y)$. Usual classification procedures provide point predictors (classifiers) $\widehat{Y}(X,\mathcal{D})$ of $Y$ or estimate the conditional distribution of $Y$ given $X$. In order to quantify the certainty of classifying $X$ we propose to construct for each $θ=1,2,...,L$ a p-value $π_θ(X,\mathcal{D})$ for the null hypothesis that $Y=θ$, treating $Y$ temporarily as a fixed parameter. In other words, the point predictor $\widehat{Y}(X,\mathcal{D})$ is replaced with a prediction region for $Y$ with a certain confidence. We argue that (i) this approach is advantageous over traditional approaches and (ii) any reasonable classifier can be modified to yield nonparametric p-values. We discuss issues such as optimality, single use and multiple use validity, as well as computational and graphical aspects.

Published in at http://dx.doi.org/10.1214/08-EJS245 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Country

Germany

Related Organizations

Keywords

FOS: Computer and information sciences, validity, Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), 62C05, 62F25, 62G09, 62G15, 62H30 (Primary), Nonparametric tolerance and confidence regions, nearest neighbors, Statistics - Machine Learning, 62G09, FOS: Mathematics, Nonparametric statistical resampling methods, 62F25, nonparametric, typicality index, 62C05, Parametric tolerance and confidence regions, Classification and discrimination; cluster analysis (statistical aspects), General considerations in statistical decision theory, prediction region, ROC curve, optimality, 62G15, 62H30, permutation test

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average