
doi: 10.1137/19m1306166
In this work, we deal with the QR factorization of block-tridiagonal matrices, where the blocks are dense and rectangular. This work is motivated by a novel method for computing geodesics over Riemannian man-ifolds. If blocks are reduced sequentially along the diagonal, only limited parallelism is available. We propose a matrix permutation approach based on the Nested Dissection method which improves parallelism at the cost of additional computations and storage. We provide a detailed analysis of the approach showing that this extra cost is bounded. Finally, we present an implementation for shared memory systems relying on task parallelism and the use of a runtime system. Experimental results support the conclusions of our analysis and show that the proposed approach leads to good performance and scalability.
[INFO.INFO-DC]Computer Science [cs]/Distributed, 65F05, Nested dissection, QR factorization, Task-based parallelism, 68W40, Parallel, 004, and Cluster Computing [cs.DC], nested dissection, 65F20, task-based parallelism, AMS subject classifications.68W10, [INFO.INFO-MS]Computer Science [cs]/Mathematical Software [cs.MS]
[INFO.INFO-DC]Computer Science [cs]/Distributed, 65F05, Nested dissection, QR factorization, Task-based parallelism, 68W40, Parallel, 004, and Cluster Computing [cs.DC], nested dissection, 65F20, task-based parallelism, AMS subject classifications.68W10, [INFO.INFO-MS]Computer Science [cs]/Mathematical Software [cs.MS]
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
