Debiasing the debiased Lasso with bootstrap

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Jan 2020Embargo end date: 01 Jan 2017Publisher:Institute of Mathematical StatisticsJournal:Electronic Journal of Statistics, volume 14 (issn: 1935-7524,

Copyright policy )

Authors: Li, Sai;

doi: 10.1214/20-ejs1713 , 10.48550/arxiv.1711.03613

arXiv: 1711.03613

Debiasing the debiased Lasso with bootstrap

- Summary
- Subjects
- Metrics

Abstract

We consider statistical inference for a single coordinate of regression coefficients in high-dimensional linear models. Recently, the debiased estimators are popularly used for constructing confidence intervals and hypothesis testing in high-dimensional models. However, some representative numerical experiments show that they tend to be biased for large coefficients, especially when the number of large coefficients dominates the number of small coefficients. In this paper, we propose a modified debiased Lasso estimator based on bootstrap. Let us denote the proposed estimator BS-DB for short. We show that, under the irrepresentable condition and other mild technical conditions, the BS-DB has smaller order of bias than the debiased Lasso in existence of a large proportion of strong signals. If the irrepresentable condition does not hold, the BS-DB is guaranteed to perform no worse than the debiased Lasso asymptotically. Confidence intervals based on the BS-DB are proposed and proved to be asymptotically valid under mild conditions. Our study on the inference problems integrates the properties of the Lasso on variable selection and estimation novelly. The superior performance of the BS-DB over the debiased Lasso is demonstrated via extensive numerical studies.

Accepted version

Related Organizations

University of Pennsylvania
United States
Rutgers, The State University of New Jersey
United States
Rutgers University
Rutgers University
Rutgers University

View all View all

Keywords

Ridge regression; shrinkage estimators (Lasso), Confidence intervals, Mathematics - Statistics Theory, Statistics Theory (math.ST), high-dimensional models, Nonparametric tolerance and confidence regions, FOS: Mathematics, Nonparametric statistical resampling methods, debiased Lasso, confidence intervals

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

5

Top 10%

Average

Top 10%

Green

gold

Fields of Science (4) View all

Fields of Science