Sparse Convolutional Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jun 2015 United States Publisher:IEEEJournal:2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Authors: Liu, Baoyuan; Wang, Min; Foroosh, Hassan; Tappen, Marshall; Penksy, Marianna;

doi: 10.1109/cvpr.2015.7298681

Sparse Convolutional Neural Networks

- Summary
- Metrics

Abstract

Deep neural networks have achieved remarkable performance in both image classification and object detection problems, at the cost of a large number of parameters and computational complexity. In this work, we show how to reduce the redundancy in these parameters using a sparse decomposition. Maximum sparsity is obtained by exploiting both inter-channel and intra-channel redundancy, with a fine-tuning step that minimize the recognition loss caused by maximizing sparsity. This procedure zeros out more than 90% of parameters, with a drop of accuracy that is less than 1% on the ILSVRC2012 dataset. We also propose an efficient sparse matrix multiplication algorithm on CPU for Sparse Convolutional Neural Networks (SCNN) models. Our CPU implementation demonstrates much higher efficiency than the off-the-shelf sparse matrix libraries, with a significant speedup realized over the original dense network. In addition, we apply the SCNN model to the object detection problem, in conjunction with a cascade model and sparse fully connected layers, to achieve significant speedups.

Country

United States

Related Organizations

Amazon.com
United States
Amazon (United States)
United States
University of Central Florida
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	135
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%