A highly optimized skeleton for unbalanced and deep divide‑and‑conquer algorithms on multi‑core clusters

Name: A highly optimized skeleton for unbalanced and deep divide‑and‑conquer algorithms on multi‑core clusters
Keywords: Template metaprogramming, Algorithmic skeletons, Multi-core clusters, Load balancing, Divide-and-conquer, Hybrid parallelism

Martínez, Millán A.; Fraguela, Basilio B.; Cabaleiro Domínguez, José Carlos

Found an issue? Give us feedback

downloadFull-Text

Minerva. Repositorio...arrow_drop_down

Minerva. Repositorio Institucional da Universidade de Santiago de Compostela

Article . 2022

License: CC BY

Full-Text: https://minerva.usc.gal/bitstreams/04010de2-0409-4c8a-afb0-e05c91a4c02c/download

Data sources: Minerva. Repositorio Institucional da Universidade de Santiago de Compostela

Recolector de Ciencia Abierta, RECOLECTA

Article . 2022

License: CC BY

Data sources: Recolector de Ciencia Abierta, RECOLECTA

A highly optimized skeleton for unbalanced and deep divide‑and‑conquer algorithms on multi‑core clusters

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2022 English Publisher:Springer

Authors: Martínez, Millán A.; Fraguela, Basilio B.; Cabaleiro Domínguez, José Carlos;

handle: 10347/42112

A highly optimized skeleton for unbalanced and deep divide‑and‑conquer algorithms on multi‑core clusters

- Summary
- Subjects
- Metrics

Abstract

Efficiently implementing the divide-and-conquer pattern of parallelism in distributed memory systems is very relevant, given its ubiquity, and difficult, given its recursive nature and the need to exchange tasks and data among the processors. This task is noticeably further complicated in the presence of multi-core systems, where hybrid parallelism must be exploited to attain the best performance, and when unbalanced and deep workloads are considered, as additional measures must be taken to load balance and avoid deep recursion problems. In this manuscript a parallel skeleton that fulfills all these requirements while providing high levels of usability is presented. In fact, the evaluation shows that our proposal is on average 415.32% faster than MPI codes and 229.18% faster than MPI + OpenMP benchmarks, while offering an average improvement in the programmability metrics of 131.04% over MPI alternatives and 155.18% over MPI + OpenMP solutions.

Related Organizations

University of Santiago de Compostela
Spain

Keywords

Template metaprogramming, Algorithmic skeletons, Multi-core clusters, Load balancing, Divide-and-conquer, Hybrid parallelism

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green