descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Dec 2016Embargo end date: 01 Jan 2016Publisher:IEEEJournal:2016 International Conference on Field-Programmable Technology (FPT)Funded by:NSERC | unidentified

Authors: Graham W. Taylor; Jasmina Vasiljevic; Shawki Areibi; Griffin Lacey; Roberto DiCecco; Paul Chow;

doi: 10.1109/fpt.2016.7929549 , 10.48550/arxiv.1609.09671

arXiv: http://arxiv.org/abs/1609.09671

Caffeinated FPGAs: FPGA framework For Convolutional Neural Networks

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Convolutional Neural Networks (CNNs) have gained significant traction in the field of machine learning, particularly due to their high accuracy in visual recognition. Recent works have pushed the performance of GPU implementations of CNNs to significantly improve their classification and training times. With these improvements, many frameworks have become available for implementing CNNs on both CPUs and GPUs, with no support for FPGA implementations. In this work we present a modified version of the popular CNN framework Caffe, with FPGA support. This allows for classification using CNN models and specialized FPGA implementations with the flexibility of reprogramming the device when necessary, seamless memory transactions between host and device, simple-to-use test benches, and the ability to create pipelined layer implementations. To validate the framework, we use the Xilinx SDAccel environment to implement an FPGA-based Winograd convolution engine and show that the FPGA layer can be used alongside other layers running on a host processor to run several popular CNNs (AlexNet, GoogleNet, VGG A, Overfeat). The results show that our framework achieves 50 GFLOPS across 3x3 convolutions in the benchmarks. This is achieved within a practical framework, which will aid in future development of FPGA-based CNNs.

Related Organizations

University of Guelph
Canada
University of Toronto
Canada

Keywords

FOS: Computer and information sciences, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Distributed, Parallel, and Cluster Computing (cs.DC)

1 Research products, page 1 of 1

convnet-benchmarks software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	70
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%