
arXiv: 1009.5165
We consider in this paper the multivariate regression problem, when the target regression matrix $A$ is close to a low rank matrix. Our primary interest in on the practical case where the variance of the noise is unknown. Our main contribution is to propose in this setting a criterion to select among a family of low rank estimators and prove a non-asymptotic oracle inequality for the resulting estimator. We also investigate the easier case where the variance of the noise is known and outline that the penalties appearing in our criterions are minimal (in some sense). These penalties involve the expected value of the Ky-Fan quasi-norm of some random matrices. These quantities can be evaluated easily in practice and upper-bounds can be derived from recent results in random matrix theory.
23 pages
60B20, 330, Linear regression; mixed models, estimator selection, Estimation in multivariate analysis, Ky-Fan norms, random matrix, Multivariate regression, Mathematics - Statistics Theory, [STAT.TH]Statistics [stat]/Statistics Theory [stat.TH], Statistics Theory (math.ST), 510, Ky-Fan norm, Random matrices (probabilistic aspects), Multivariate analysis, [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST], 62J05, FOS: Mathematics, multivariate regression, 62H99
60B20, 330, Linear regression; mixed models, estimator selection, Estimation in multivariate analysis, Ky-Fan norms, random matrix, Multivariate regression, Mathematics - Statistics Theory, [STAT.TH]Statistics [stat]/Statistics Theory [stat.TH], Statistics Theory (math.ST), 510, Ky-Fan norm, Random matrices (probabilistic aspects), Multivariate analysis, [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST], 62J05, FOS: Mathematics, multivariate regression, 62H99
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 16 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
