
The problem of ranking/ordering instances, instead of simply classifying them, has recently gained much attention in machine learning. In this paper we formulate the ranking problem in a rigorous statistical framework. The goal is to learn a ranking rule for deciding, among two instances, which one is "better," with minimum ranking risk. Since the natural estimates of the risk are of the form of a U-statistic, results of the theory of U-processes are required for investigating the consistency of empirical risk minimizers. We establish in particular a tail inequality for degenerate U-processes, and apply it for showing that fast rates of convergence may be achieved under specific noise assumptions, just like in classification. Convex risk minimization methods are also studied.
32 pages
[MATH.MATH-PR] Mathematics [math]/Probability [math.PR], 68Q32, moment inequalities, Mathematics - Statistics Theory, [MATH] Mathematics [math], Statistics Theory (math.ST), FOS: Mathematics, 60C05, Inequalities; stochastic orderings, 60G25, Prediction theory (aspects of stochastic processes), theory of classification, [MATH.MATH-ST] Mathematics [math]/Statistics [math.ST], Combinatorial probability, Classification and discrimination; cluster analysis (statistical aspects), Computational learning theory, VC classes, convex risk minimization, Statistical learning, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], fast rates, \(U\)-processes, 68Q32, 60G99, 62G99, 62M99, U-processes, statistical learning, 60E15
[MATH.MATH-PR] Mathematics [math]/Probability [math.PR], 68Q32, moment inequalities, Mathematics - Statistics Theory, [MATH] Mathematics [math], Statistics Theory (math.ST), FOS: Mathematics, 60C05, Inequalities; stochastic orderings, 60G25, Prediction theory (aspects of stochastic processes), theory of classification, [MATH.MATH-ST] Mathematics [math]/Statistics [math.ST], Combinatorial probability, Classification and discrimination; cluster analysis (statistical aspects), Computational learning theory, VC classes, convex risk minimization, Statistical learning, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], fast rates, \(U\)-processes, 68Q32, 60G99, 62G99, 62M99, U-processes, statistical learning, 60E15
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 169 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
