Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 21 Sep 2023Embargo end date: 01 Jan 2022Publisher:ACMJournal:Proceedings of the 2023 Workshop on Compilers, Deployment, and Tooling for Edge AI

Authors: Federico Nicolas Peccia; Oliver Bringmann;

doi: 10.1145/3615338.3618130 , 10.48550/arxiv.2212.03034

arXiv: 2212.03034

Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

The deployment of neural networks on heterogeneous SoCs coupled with custom accelerators is a challenging task because of the lack of end-to-end software tools provided for these systems. Moreover, the already available low level schedules and mapping strategies provided by the accelerator developers for typical tensor operations are not necessarily the best possible ones for each particular use case. This is why frameworks which automatically test the performance of the generated code on a specific hardware configuration are of special interest. In this work, the integration between the code generation framework TVM and the systolic array-based accelerator Gemmini is presented. A generic schedule to offload the GEneral Matrix Multiply (GEMM) tensor operation onto Gemmini is detailed, and its suitability is tested by executing the AutoTVM tuning process on it. Our generated code achieves a peak throughput of 46 giga-operations per second (GOPs) under a 100 MHz clock on a Xilinx ZCU102 FPGA, outperforming previous work. Furthermore, the code generated by this integration was able to surpass the default hand-tuned schedules provided by the Gemmini developers in real-world workloads.

6 pages, 5 figures, submitted to the CODAI Workshop at the 2022 ESWEEK

Related Organizations

Research Center for Information Technology
Germany
University of Tübingen
Germany

Keywords

Performance (cs.PF), FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Performance, Computer Science - Programming Languages, Machine Learning (cs.LG), Programming Languages (cs.PL)

4 Research products, page 1 of 1

GeMMINi: Prototipado de interfaces de usuario sobre múltiples dispositivos. Una estrategia basada en Líneas de Producto y MDD
2012IsAmongTopNSimilarDocuments
Carry-Propagation-Adder-Factored Gemmini Systolic Array for Machine Learning Acceleration
2021IsAmongTopNSimilarDocuments
Deep Learning Accelerators’ Configuration Space Exploration Effect on Performance and Resource Utilization: A Gemmini Case Study
2023IsAmongTopNSimilarDocuments
Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

5

Top 10%

Average

Top 10%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework

Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework

4 Research products, page 1 of 1

GeMMINi: Prototipado de interfaces de usuario sobre múltiples dispositivos. Una estrategia basada en Líneas de Producto y MDD

Carry-Propagation-Adder-Factored Gemmini Systolic Array for Machine Learning Acceleration

Deep Learning Accelerators’ Configuration Space Exploration Effect on Performance and Resource Utilization: A Gemmini Case Study

Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration