
arXiv: 0909.4061
Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets. This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed---either explicitly or implicitly---to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, speed, and robustness. These claims are supported by extensive numerical experiments and a detailed error analysis.
Johnson– Lindenstrauss lemma, dimension reduction, principal component analysis, interpolative decomposition, Probability (math.PR), singular value decomposition, random matrix, Numerical Analysis (math.NA), eigenvalue decomposition, rank-revealing QR factorization, matrix approximation, randomized algorithm, 004, 510, pass-efficient algorithm, parallel algorithm, streaming algorithm, FOS: Mathematics, Mathematics - Numerical Analysis, Mathematics - Probability
Johnson– Lindenstrauss lemma, dimension reduction, principal component analysis, interpolative decomposition, Probability (math.PR), singular value decomposition, random matrix, Numerical Analysis (math.NA), eigenvalue decomposition, rank-revealing QR factorization, matrix approximation, randomized algorithm, 004, 510, pass-efficient algorithm, parallel algorithm, streaming algorithm, FOS: Mathematics, Mathematics - Numerical Analysis, Mathematics - Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3K | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 0.01% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 0.01% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 0.1% |
