Minimax risks for sparse regressions: Ultra-high dimensional phenomenons

Name: Minimax risks for sparse regressions: Ultra-high dimensional phenomenons
Creator: Verzelen, Nicolas
Keywords: model selection, high-dimensional geometry, dimension reduction, Minimax procedures in statistical decision theory, adaptive estimation, Mathematics - Statistics Theory, 02 engineering and technology, Statistics Theory (math.ST), 01 natural sciences, 519

Verzelen, Nicolas

Found an issue? Give us feedback

downloadFull-Text

Institut National de...arrow_drop_down

Institut National de la Recherche Agronomique: ProdINRA

Article . 2012

Full-Text: https://hal.inrae.fr/hal-02647512

Data sources: Bielefeld Academic Search Engine (BASE)

Electronic Journal of Statistics

Article . 2012 . Peer-reviewed

Data sources: Crossref

Electronic Journal of Statistics

Article

License: implied-oa

Data sources: UnpayWall

arXiv.org e-Print Archive

Preprint . 2010

Data sources: arXiv.org e-Print Archive

Hal

Article . 2012

Data sources: Hal

ProdInra

Article . 2012

License: CC BY SA

Data sources: ProdInra

HAL INRAE

Article . 2012

Data sources: HAL INRAE

Electronic Journal of Statistics

Other literature type . 2012

Data sources: Project Euclid

zbMATH Open

Article . 2012

Data sources: zbMATH Open

https://dx.doi.org/10.48550/ar...

Article . 2010

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

https://dx.doi.org/10.1214/12-...

Article

Data sources: Microsoft Academic Graph

Minimax risks for sparse regressions: Ultra-high dimensional phenomenons

Minimax risks for sparse regressions: ultra-high dimensional phenomenons

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type , Report 01 Jan 2012Embargo end date: 01 Jan 2010 France Publisher:Institute of Mathematical StatisticsJournal:Electronic Journal of Statistics, volume 6 (issn: 1935-7524,

Copyright policy )

Authors: Verzelen, Nicolas;

doi: 10.1214/12-ejs666 , 10.48550/arxiv.1008.0526

arXiv: 1008.0526

Minimax risks for sparse regressions: Ultra-high dimensional phenomenons

- Summary
- Subjects
- Metrics

Abstract

Consider the standard Gaussian linear regression model $Y=X��+��$, where $Y\in R^n$ is a response vector and $ X\in R^{n*p}$ is a design matrix. Numerous work have been devoted to building efficient estimators of $��$ when $p$ is much larger than $n$. In such a situation, a classical approach amounts to assume that $��_0$ is approximately sparse. This paper studies the minimax risks of estimation and testing over classes of $k$-sparse vectors $��$. These bounds shed light on the limitations due to high-dimensionality. The results encompass the problem of prediction (estimation of $X��$), the inverse problem (estimation of $��_0$) and linear testing (testing $X��=0$). Interestingly, an elbow effect occurs when the number of variables $k\log(p/k)$ becomes large compared to $n$. Indeed, the minimax risks and hypothesis separation distances blow up in this ultra-high dimensional setting. We also prove that even dimension reduction techniques cannot provide satisfying results in an ultra-high dimensional setting. Moreover, we compute the minimax risks when the variance of the noise is unknown. The knowledge of this variance is shown to play a significant role in the optimal rates of estimation and testing. All these minimax bounds provide a characterization of statistical problems that are so difficult so that no procedure can provide satisfying results.

Country

France

Related Organizations

Keywords

model selection, high-dimensional geometry, dimension reduction, Minimax procedures in statistical decision theory, adaptive estimation, Mathematics - Statistics Theory, Statistics Theory (math.ST), 519, Adaptive estimation;dimension reduction;high-dimensional regression;high-dimensional geometry;minimax risk, géométrie, Adaptive estimation, [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST], 62J05, minimax risk, FOS: Mathematics, Statistiques (Mathématiques), minimax hypothesis testing, [MATH.MATH-ST] Mathematics [math]/Statistics [math.ST], 62C20, Linear regression; mixed models, mathématique, estimation du risque, High-dimensional regression, sparse vectors, [STAT.TH]Statistics [stat]/Statistics Theory [stat.TH], high-dimensional regression, 62F35

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	70
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

70

Top 10%

Green

gold

Fields of Science (4) View all

Fields of Science

Related to Research communities

INRAE