Random matrix-improved estimation of covariance matrix distances

Name: Random matrix-improved estimation of covariance matrix distances
Keywords: FOS: Computer and information sciences, covariance estimation, Computer Science - Machine Learning, 330, Random matrices (algebraic aspects), [class=MSC] random matrix theory, Estimation in multivariate analysis, Probability (math.PR), 2010 MSC: Secondary 62M45, distances and divergences 2010 MSC: Primary 60B20

Couillet, Romain; Tiomoko, Malik; Zozor, Steeve; Moisan, Eric

Found an issue? Give us feedback

Journal of Multivari...arrow_drop_down

Journal of Multivariate Analysis

Article

Data sources: UnpayWall

arXiv.org e-Print Archive

Preprint . 2018

Data sources: arXiv.org e-Print Archive

Université Grenoble Alpes: HAL

Article . 2019

Data sources: Bielefeld Academic Search Engine (BASE)

Journal of Multivariate Analysis

Article . 2019 . Peer-reviewed

License: Elsevier Non-Commercial

Data sources: Crossref

zbMATH Open

Article . 2019

Data sources: zbMATH Open

https://dx.doi.org/10.48550/ar...

Article . 2018

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Article

Data sources: DBLP

DBLP

Article

Data sources: DBLP

https://dx.doi.org/10.1016/j.j...

Article

Data sources: Microsoft Academic Graph

Random matrix-improved estimation of covariance matrix distances

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Nov 2019Embargo end date: 01 Jan 2018 France English Publisher:Elsevier BVJournal:Journal of Multivariate Analysis, volume 174, page 104,531 (issn: 0047-259X,

Copyright policy )Funded by:ANR | RMT4GRAPH

Authors: Couillet, Romain; Tiomoko, Malik; Zozor, Steeve; Moisan, Eric;

doi: 10.1016/j.jmva.2019.06.009 , 10.48550/arxiv.1810.04534

arXiv: 1810.04534

Random matrix-improved estimation of covariance matrix distances

- Summary
- Subjects
- Metrics

Abstract

Given two sets $x_1^{(1)},\ldots,x_{n_1}^{(1)}$ and $x_1^{(2)},\ldots,x_{n_2}^{(2)}\in\mathbb{R}^p$ (or $\mathbb{C}^p$) of random vectors with zero mean and positive definite covariance matrices $C_1$ and $C_2\in\mathbb{R}^{p\times p}$ (or $\mathbb{C}^{p\times p}$), respectively, this article provides novel estimators for a wide range of distances between $C_1$ and $C_2$ (along with divergences between some zero mean and covariance $C_1$ or $C_2$ probability measures) of the form $\frac1p\sum_{i=1}^n f(��_i(C_1^{-1}C_2))$ (with $��_i(X)$ the eigenvalues of matrix $X$). These estimators are derived using recent advances in the field of random matrix theory and are asymptotically consistent as $n_1,n_2,p\to\infty$ with non trivial ratios $p/n_1<1$ and $p/n_2<1$ (the case $p/n_2>1$ is also discussed). A first "generic" estimator, valid for a large set of $f$ functions, is provided under the form of a complex integral. Then, for a selected set of $f$'s of practical interest (namely, $f(t)=t$, $f(t)=\log(t)$, $f(t)=\log(1+st)$ and $f(t)=\log^2(t)$), a closed-form expression is provided. Beside theoretical findings, simulation results suggest an outstanding performance advantage for the proposed estimators when compared to the classical "plug-in" estimator $\frac1p\sum_{i=1}^n f(��_i(\hat C_1^{-1}\hat C_2))$ (with $\hat C_a=\frac1{n_a}\sum_{i=1}^{n_a}x_i^{(a)}x_i^{(a){\sf T}}$), and this even for very small values of $n_1,n_2,p$.

Country

France

Related Organizations

Keywords

FOS: Computer and information sciences, covariance estimation, Computer Science - Machine Learning, 330, Random matrices (algebraic aspects), [class=MSC] random matrix theory, Estimation in multivariate analysis, Probability (math.PR), 2010 MSC: Secondary 62M45, distances and divergences 2010 MSC: Primary 60B20, Mathematics - Statistics Theory, Statistics Theory (math.ST), random matrix theory, 510, Machine Learning (cs.LG), [STAT]Statistics [stat], Neural nets and related approaches to inference from stochastic processes, Random matrices (probabilistic aspects), FOS: Mathematics, distances and divergences, Mathematics - Probability

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

4

Average

Green

bronze

Fields of Science (4) View all

Fields of Science

Funded by