Bimodal-Distributed Binarized Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 03 Nov 2022Embargo end date: 01 Jan 2022 English Publisher:MDPI AGJournal:Mathematics, volume 10, page 4,107 (eissn: 2227-7390,

Copyright policy )

Authors: Tal Rozen; Moshe Kimhi; Brian Chmiel; Avi Mendelson; Chaim Baskin;

doi: 10.3390/math10214107 , 10.48550/arxiv.2204.02004

arXiv: 2204.02004

Bimodal-Distributed Binarized Neural Networks

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Binary neural networks (BNNs) are an extremely promising method for reducing deep neural networks’ complexity and power consumption significantly. Binarization techniques, however, suffer from ineligible performance degradation compared to their full-precision counterparts. Prior work mainly focused on strategies for sign function approximation during the forward and backward phases to reduce the quantization error during the binarization process. In this work, we propose a bimodal-distributed binarization method (BD-BNN). The newly proposed technique aims to impose a bimodal distribution of the network weights by kurtosis regularization. The proposed method consists of a teacher–trainer training scheme termed weight distribution mimicking (WDM), which efficiently imitates the full-precision network weight distribution to their binary counterpart. Preserving this distribution during binarization-aware training creates robust and informative binary feature maps and thus it can significantly reduce the generalization error of the BNN. Extensive evaluations on CIFAR-10 and ImageNet demonstrate that our newly proposed BD-BNN outperforms current state-of-the-art schemes.

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, convolutional neural networks; binarization; quantization; efficient inference deployment, Machine Learning (cs.LG), efficient inference deployment, convolutional neural networks, QA1-939, quantization, binarization, Mathematics

1 Research products, page 1 of 1

BD-BNN software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average