Modern Generative Programming for Optimizing Small Matrix-Vector Multiplication

Jules Penuchot; Joel Falcou; Amal Khabou

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1109/hpcs.2...

Article . 2018 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.1109/hpc...

Article

Data sources: Microsoft Academic Graph

Modern Generative Programming for Optimizing Small Matrix-Vector Multiplication

descriptionPublicationkeyboard_double_arrow_right Article 01 Jul 2018Publisher:IEEEJournal:2018 International Conference on High Performance Computing & Simulation (HPCS)

Authors: Jules Penuchot; Joel Falcou; Amal Khabou;

doi: 10.1109/hpcs.2018.00086

Modern Generative Programming for Optimizing Small Matrix-Vector Multiplication

- Summary
- Metrics

Abstract

BLAS-level functions are the cornerstone of a large subset of applications. If a large body of work surrounding efficient and large-scale implementation of some routines such as gemv exists, the interest for small-sized, highly-optimized versions of those routines emerged. In this paper, we propose to show how a modern C++ approach based on generative programming techniques such as vectorization and loop unrolling in the framework of meta-programming can deliver efficient automatically generated codes for such routines, that are competitive with existing, hand-tuned library kernels with a very low programming effort compared to writing assembly code. In particular, we analyze the performance of automatically generated small-sized gemv kernels for both Intel x86 and ARM processors. We show through a performance comparison with the OpenBLAS gemv kernel on small matrices of sizes ranging from 4 to 32 that our C++ kernels are very efficient and may have a performance that is up to 3 times better than that of OpenBLAS gemv.

Related Organizations

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Average

Fields of Science (3) View all

Fields of Science

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now