
arXiv: 2303.02352
handle: 20.500.14243/456142 , 20.500.14243/520767 , 11572/401010
We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting NVIDIA Graphics Processing Unit (GPU) accelerators. The work extends our previous efforts in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the single GPU kernels. Strong and weak scalability results on well-known benchmark test cases of the new version of the library are discussed. Comparisons with the Nvidia AmgX solution show an improvement of up to 2.0x in the solve phase.
FOS: Computer and information sciences, iterative sparse linear solvers, Computer Science - Distributed, Parallel, and Cluster Computing, parallel numerical algorithms, GPU accelerators, Computer Science - Mathematical Software, Distributed, Parallel, and Cluster Computing (cs.DC), GPU accelerators; heterogeneous computing; iterative sparse linear solvers; parallel numerical algorithms; scalability, heterogeneous computing, Mathematical Software (cs.MS), scalability
FOS: Computer and information sciences, iterative sparse linear solvers, Computer Science - Distributed, Parallel, and Cluster Computing, parallel numerical algorithms, GPU accelerators, Computer Science - Mathematical Software, Distributed, Parallel, and Cluster Computing (cs.DC), GPU accelerators; heterogeneous computing; iterative sparse linear solvers; parallel numerical algorithms; scalability, heterogeneous computing, Mathematical Software (cs.MS), scalability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
