
The concept of biased data is well known and its practical applications range from social sciences and biology to economics and quality control. These observations arise when a sampling procedure chooses an observation with probability that depends on the value of the observation. This is an interesting sampling procedure because it favors some observations and neglects others. It is known that biasing does not change rates of nonparametric density estimation, but no results are available about sharp constants. This article presents asymptotic results on sharp minimax density estimation. In particular, a coefficient of difficulty is introduced that shows the relationship between sample sizes of direct and biased samples that imply the same accuracy of estimation. The notion of the restricted local minimax, where a low-frequency part of the estimated density is known, is introduced; it sheds new light on the phenomenon of nonparametric superefficiency. Results of a numerical study are presented.
62E20, 625G07 (Primary) 62C05, 62E20. (Secondary), Asymptotic distribution theory in statistics, coefficient of difficulty, 625G07, Mathematics - Statistics Theory, adaptation, Statistics Theory (math.ST), small sample, Density estimation, Asymptotic properties of nonparametric inference, FOS: Mathematics, nonparametric, Adaptation, 62C05, restricted minimax, average risk
62E20, 625G07 (Primary) 62C05, 62E20. (Secondary), Asymptotic distribution theory in statistics, coefficient of difficulty, 625G07, Mathematics - Statistics Theory, adaptation, Statistics Theory (math.ST), small sample, Density estimation, Asymptotic properties of nonparametric inference, FOS: Mathematics, nonparametric, Adaptation, 62C05, restricted minimax, average risk
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 37 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
