
In this paper, we introduce an approach called coupled filters decomposition, which builds on the key observation that redundancy exists among filters within a convolutional layer, meaning that similar filters can produce partially overlapping outputs. Leveraging this insight, we propose a joint decomposition of filters using coupled tensor decompositions, specifically coupled canonical polyadic decomposition, which enables the sharing of a common factor matrix across similar filters. This joint factorization not only reduces the number of parameters but also lowers computational complexity by eliminating redundant computations. To further improve efficiency, we first cluster the filters before decomposition. The grouping relies on a custom metric based on the subspace spanned by the shared-mode factor. Within each group, the coupling constraint is less restrictive. Extensive experiments across various architectures, datasets, and tasks validate the effectiveness of our method, demonstrating its competitive performance compared to state-of-the-art model compression techniques. Our code is available for research purposes at codec-ai.github.io.
coupled tensor decomposition, canonical polyadic decomposition, low-rank approximations, neural network compression, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing
coupled tensor decomposition, canonical polyadic decomposition, low-rank approximations, neural network compression, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
