
The Block Recursive Inversion (BRI) algorithm calculates the inversion of large k x k block matrices with limited memory during the entire processing because it calculates one block of the inverse at a time. However, the lower memory consumption is counterbalanced by higher computational complexity. We propose a parallel BRI implementation, which also calculates one block at a time, to reduce execution time and extend its applicability by exploiting modern multi-core architectures. The proposed parallel BRI was implemented for shared memory systems in OpenMP. The results of a performance and scalability analysis for different use cases reveals opposite trends in execution time, with the proposed parallel implementation being faster for larger k. Although not weakly scalable for a fixed k, scalability tends to increase with the increase of k or, equivalently, with the reduction of memory requirements.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
