
arXiv: 0803.1909
This paper considers estimating a covariance matrix of $p$ variables from $n$ observations by either banding or tapering the sample covariance matrix, or estimating a banded version of the inverse of the covariance. We show that these estimates are consistent in the operator norm as long as $(\log p)/n\to0$, and obtain explicit rates. The results are uniform over some fairly natural well-conditioned families of covariance matrices. We also introduce an analogue of the Gaussian white noise model and show that if the population covariance is embeddable in that model and well-conditioned, then the banded approximations produce consistent estimates of the eigenvalues and associated eigenvectors of the covariance matrix. The results can be extended to smooth versions of banding and to non-Gaussian distributions with sufficiently short tails. A resampling approach is proposed for choosing the banding parameter in practice. This approach is illustrated numerically on both simulated and real data.
Published in at http://dx.doi.org/10.1214/009053607000000758 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
covariance matrix, Eigenvalues, singular values, and eigenvectors, Covariance matrix, Estimation in multivariate analysis, Bootstrap, jackknife and other resampling methods, Mathematics - Statistics Theory, Statistics Theory (math.ST), regularization, 62G09, FOS: Mathematics, 62H12, 62F12, banding, Cholesky decomposition, Asymptotic properties of parametric estimators, 62H12 (Primary) 62F12, 62G09 (Secondary)
covariance matrix, Eigenvalues, singular values, and eigenvectors, Covariance matrix, Estimation in multivariate analysis, Bootstrap, jackknife and other resampling methods, Mathematics - Statistics Theory, Statistics Theory (math.ST), regularization, 62G09, FOS: Mathematics, 62H12, 62F12, banding, Cholesky decomposition, Asymptotic properties of parametric estimators, 62H12 (Primary) 62F12, 62G09 (Secondary)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 869 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 0.1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
