
AbstractIn recent years, Convolutional Neural Network CNN have been incorporated in a large number of applications, including multimedia retrieval and image classification. However, CNN based algorithms are computationally and resource intensive and therefore difficult to be used in embedded systems. FPGA based accelerators are becoming more and more popular in research and industry due to their flexibility and energy efficiency. However, the available resources and the size of the on-chip memory can limit the performance of the FPGA accelerator for CNN. This work proposes an High-Level Synthesis HLS library for CNN algorithms. It contains seven different streaming-capable CNN (plus two conversion) functions for creating large neural networks with deep pipelines. The different functions have many parameter settings (e.g. for resolution, feature maps, data types, kernel size, parallelilization, accuracy, etc.), which also enable compile-time optimizations. Our functions are integrated into the HiFlipVX library, which is an open source HLS FPGA library for image processing and object detection. This offers the possibility to implement different types of computer vision applications with one library. Due to the various configuration and parallelization possibilities of the library functions, it is possible to implement a high-performance, scalable and resource-efficient system, as our evaluation of the MobileNets algorithm shows.
Artificial neural network, Artificial intelligence, Flexibility (engineering), Compiler, Convolutional neural network, Database, Engineering, Image Feature Retrieval and Recognition Techniques, Computer engineering, CMOS Image Sensor Technology, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Neural Network Architectures, Computer architecture, Electrical and Electronic Engineering, Embedded system, Geography, Statistics, Scalability, Deep learning, Computer science, Field-programmable gate array, Operating system, High-level synthesis, Combinatorics, Computer Science, Physical Sciences, Kernel (algebra), Deep Learning in Computer Vision and Image Recognition, Pipeline (software), Computer Vision and Pattern Recognition, Benchmark (surveying), Mathematics, Geodesy
Artificial neural network, Artificial intelligence, Flexibility (engineering), Compiler, Convolutional neural network, Database, Engineering, Image Feature Retrieval and Recognition Techniques, Computer engineering, CMOS Image Sensor Technology, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Neural Network Architectures, Computer architecture, Electrical and Electronic Engineering, Embedded system, Geography, Statistics, Scalability, Deep learning, Computer science, Field-programmable gate array, Operating system, High-level synthesis, Combinatorics, Computer Science, Physical Sciences, Kernel (algebra), Deep Learning in Computer Vision and Image Recognition, Pipeline (software), Computer Vision and Pattern Recognition, Benchmark (surveying), Mathematics, Geodesy
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 13 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
