
arXiv: 1606.09402
Randomized algorithms for low-rank matrix approximation are investigated, with the emphasis on the fixed-precision problem and computational efficiency for handling large matrices. The algorithms are based on the so-called QB factorization, where Q is an orthonormal matrix. Firstly, a mechanism for calculating the approximation error in Frobenius norm is proposed, which enables efficient adaptive rank determination for large and/or sparse matrix. It can be combined with any QB-form factorization algorithm in which B's rows are incrementally generated. Based on the blocked randQB algorithm by P.-G. Martinsson and S. Voronin, this results in an algorithm called randQB EI. Then, we further revise the algorithm to obtain a pass-efficient algorithm, randQB FP, which is mathematically equivalent to the existing randQB algorithms and also suitable for the fixed-precision problem. Especially, randQB FP can serve as a single-pass algorithm for calculating leading singular values, under certain condition. With large and/or sparse test matrices, we have empirically validated the merits of the proposed techniques, which exhibit remarkable speedup and memory saving over the blocked randQB algorithm. We have also demonstrated that the single-pass algorithm derived by randQB FP is much more accurate than an existing single-pass algorithm. And with data from a scenic image and an information retrieval application, we have shown the advantages of the proposed algorithms over the adaptive range finder algorithm for solving the fixed-precision problem.
21 pages, 10 figures
Numerical computation of eigenvalues and eigenvectors of matrices, FOS: Computer and information sciences, Eigenvalues, singular values, and eigenvectors, Low-rank matrix approximation, Factorization of matrices, randomized algorithm, Computer Science - Data Structures and Algorithms, low-rank matrix approximation, FOS: Mathematics, Data Structures and Algorithms (cs.DS), Mathematics - Numerical Analysis, fixed-precision problem, Randomized algorithm, Computer Sciences, Theory and Algorithms, Pass-efficient algorithm, Randomized algorithms, Other matrix algorithms, Numerical Analysis (math.NA), Fixed-precision problem, 15A18, 65F30, 65F15, 68W20, 60B20, Random matrices (probabilistic aspects), Computer Science - Distributed, Parallel, and Cluster Computing, Adaptive rank determination, Distributed, Parallel, and Cluster Computing (cs.DC), adaptive rank determination
Numerical computation of eigenvalues and eigenvectors of matrices, FOS: Computer and information sciences, Eigenvalues, singular values, and eigenvectors, Low-rank matrix approximation, Factorization of matrices, randomized algorithm, Computer Science - Data Structures and Algorithms, low-rank matrix approximation, FOS: Mathematics, Data Structures and Algorithms (cs.DS), Mathematics - Numerical Analysis, fixed-precision problem, Randomized algorithm, Computer Sciences, Theory and Algorithms, Pass-efficient algorithm, Randomized algorithms, Other matrix algorithms, Numerical Analysis (math.NA), Fixed-precision problem, 15A18, 65F30, 65F15, 68W20, 60B20, Random matrices (probabilistic aspects), Computer Science - Distributed, Parallel, and Cluster Computing, Adaptive rank determination, Distributed, Parallel, and Cluster Computing (cs.DC), adaptive rank determination
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 47 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
