DTM@GPU: Characterizing and evaluating trace redundancy in GPU

descriptionPublicationkeyboard_double_arrow_right Article 28 Feb 2018 English Publisher:WileyJournal:Concurrency and Computation: Practice and Experience, volume 31 (issn: 1532-0626, eissn: 1532-0634,

Copyright policy )

Authors: Leandro A. J. Marzulo; Alexandre da Costa Sena; Alexandre Solon Nery; Cristiana Bentes; Igor Machado Coelho; Maria Clicia Stelling de Castro; Saulo T. Oliveira; +2 Authors

doi: 10.1002/cpe.4450

DTM@GPU: Characterizing and evaluating trace redundancy in GPU

- Summary
- Metrics

Abstract

SummaryIn a program, there is usually a significant amount of instructions that are repeatedly executed with the same inputs during the execution. This redundancy allows the reuse of previous computations, potentially reducing the program execution time. The Dynamic Trace Memoization technique (DTM) was proposed to exploit the reuse of a dynamic sequence of redundant instructions for superscalar CPUs. This paper proposes the application of the DTM technique on a GPU architecture. We propose the DTM@GPU model that adapts the original DTM technique to the NVIDIA GPU architecture by introducing architectural modifications and the identification of different trace reuse styles in multithreaded environments. We investigate reuse opportunities in real‐world GPU applications and the potential performance gains. We also perform a detailed investigation on the characteristics of the reused traces. This characterization shows the number and size of the reused traces, the influence of the cache size on reuse rates, and the cycles that are saved when all threads in a warp reuse instructions or traces. The results show approximately up to 35.3% of reuse, yielding an estimated speedup gain of 10.7%.

Related Organizations

Federal University of Rio de Janeiro
Brazil
Universidade do Estado do Rio de Janeiro
Brazil

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average