Downloads provided by UsageCounts
We investigate a parallelization strategy for dense matrix factorization (DMF) algorithms, using OpenMP, that departs from the legacy (or conventional) solution, which simply extracts concurrency from a multithreaded version of BLAS. This approach is also different from the more sophisticated runtime-assisted implementations, which decompose the operation into tasks and identify dependencies via directives and runtime support. Instead, our strategy attains high performance by explicitly embedding a static look-ahead technique into the DMF code, in order to overcome the performance bottleneck of the panel factorization, and realizing the trailing update via a cache-aware multi-threaded implementation of the BLAS. Although the parallel algorithms are specified with a highlevel of abstraction, the actual implementation can be easily derived from them, paving the road to deriving a high performance implementation of a considerable fraction of LAPACK functionality on any multicore platform with an OpenMP-like runtime.
28 pages
FOS: Computer and information sciences, Matrix factorizations, matrix factorizations, lightweight threads, OpenMP, multi-threading, openMP, look-ahead, high performance computing, ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES, Multi-threading, Computer Science - Distributed, Parallel, and Cluster Computing, Lightweight threads, Look-ahead, Computer Science - Mathematical Software, High performance computing, Distributed, Parallel, and Cluster Computing (cs.DC), Mathematical Software (cs.MS)
FOS: Computer and information sciences, Matrix factorizations, matrix factorizations, lightweight threads, OpenMP, multi-threading, openMP, look-ahead, high performance computing, ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES, Multi-threading, Computer Science - Distributed, Parallel, and Cluster Computing, Lightweight threads, Look-ahead, Computer Science - Mathematical Software, High performance computing, Distributed, Parallel, and Cluster Computing (cs.DC), Mathematical Software (cs.MS)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 83 | |
| downloads | 140 |

Views provided by UsageCounts
Downloads provided by UsageCounts