
arXiv: 2006.11201
handle: 10419/241905
We consider both $\ell _{0}$-penalized and $\ell _{0}$-constrained quantile regression estimators. For the $\ell _{0}$-penalized estimator, we derive an exponential inequality on the tail probability of excess quantile prediction risk and apply it to obtain non-asymptotic upper bounds on the mean-square parameter and regression function estimation errors. We also derive analogous results for the $\ell _{0}$-constrained estimator. The resulting rates of convergence are nearly minimax-optimal and the same as those for $\ell _{1}$-penalized and non-convex penalized estimators. Further, we characterize expected Hamming loss for the $\ell _{0}$-penalized estimator. We implement the proposed procedure via mixed integer linear programming and also a more scalable first-order approximation algorithm. We illustrate the finite-sample performance of our approach in Monte Carlo experiments and its usefulness in a real data application concerning conformal prediction of infant birth weights (with $n\approx 10^{3}$ and up to $p>10^{3}$). In sum, our $\ell _{0}$-based method produces a much sparser estimator than the $\ell _{1}$-penalized and non-convex penalized approaches without compromising precision.
51 pages, 3 figures, 3 tables
FOS: Computer and information sciences, quantile regression, ddc:330, finitesample property, Statistics, Game theory, economics, finance, and other social and behavioral sciences, Econometrics (econ.EM), mixed integer optimization, finite sample property, conformal prediction, Methodology (stat.ME), FOS: Economics and business, Hamming distance, sparse estimation, Statistics - Methodology, Economics - Econometrics
FOS: Computer and information sciences, quantile regression, ddc:330, finitesample property, Statistics, Game theory, economics, finance, and other social and behavioral sciences, Econometrics (econ.EM), mixed integer optimization, finite sample property, conformal prediction, Methodology (stat.ME), FOS: Economics and business, Hamming distance, sparse estimation, Statistics - Methodology, Economics - Econometrics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
