Extracting SIMD Parallelism from Recursive Task-Parallel Programs

descriptionPublicationkeyboard_double_arrow_right Article 26 Dec 2019 English Publisher:Association for Computing Machinery (ACM)Journal:ACM Transactions on Parallel Computing, volume 6, pages 1-37 (issn: 2329-4949, eissn: 2329-4957,

Copyright policy )

Authors: Bin Ren; Shruthi Balakrishna; Youngjoon Jo; Sriram Krishnamoorthy; Kunal Agrawal 0001; Milind Kulkarni 0001;

doi: 10.1145/3365663

Extracting SIMD Parallelism from Recursive Task-Parallel Programs

- Summary
- Metrics

Abstract

The pursuit of computational efficiency has led to the proliferation of throughput-oriented hardware, from GPUs to increasingly wide vector units on commodity processors and accelerators. This hardware is designed to execute data-parallel computations in a vectorized manner efficiently. However, many algorithms are more naturally expressed as divide-and-conquer, recursive, task-parallel computations. In the absence of data parallelism, it seems that such algorithms are not well suited to throughput-oriented architectures. This article presents a set of novel code transformations that expose the data parallelism latent in recursive, task-parallel programs. These transformations facilitate straightforward vectorization of task-parallel programs on commodity hardware. We also present scheduling policies that maintain high utilization of vector resources while limiting space usage. Across several task-parallel benchmarks, we demonstrate both efficient vector resource utilization and substantial speedup on chips using Intel’s SSE4.2 vector units, as well as accelerators using Intel’s AVX512 units. We then show through rigorous sampling that, in practice, our vectorization techniques are effective for a much larger class of programs.

Related Organizations

Pacific Northwest National Laboratory
United States
Purdue University System
United States
Washington University in St. Louis
United States
Purdue University West Lafayette
United States
University of Mary
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now