Implications of Deep Compression with Complex Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article 30 Jul 2023Publisher:Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESPJournal:International Journal of Soft Computing and Engineering, volume 13, pages 1-6 (eissn: 2231-2307,

Copyright policy )

Authors: Lily Young; James Richrdson York; Byeong Kil Lee;

doi: 10.35940/ijsce.c3613.0713323

Implications of Deep Compression with Complex Neural Networks

- Summary
- Subjects
- Metrics

Abstract

Deep learning and neural networks have become increasingly popular in the area of artificial intelligence. These models have the capability to solve complex problems, such as image recognition or language processing. However, the memory utilization and power consumption of these networks can be very large for many applications. This has led to research into techniques to compress the size of these models while retaining accuracy and performance. One of the compression techniques is the deep compression three-stage pipeline, including pruning, trained quantization, and Huffman coding. In this paper, we apply the principles of deep compression to multiple complex networks in order to compare the effectiveness of deep compression in terms of compression ratio and the quality of the compressed network. While the deep compression pipeline is effectively working for CNN and RNN models to reduce the network size with small performance degradation, it is not properly working for more complicated networks such as GAN. In our GAN experiments, performance degradation is too much from the compression. For complex neural networks, careful analysis should be done for discovering which parameters allow a GAN to be compressed without loss in output quality.

Related Organizations

University of Colorado Colorado Springs
United States

Keywords

Neural Network, Network Compression, Pruning, Quantization, CNN, RNN, GAN.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%