Downloads provided by UsageCounts
doi: 10.1609/aaai.v36i7.20763 , 10.48550/arxiv.2201.03624 , 10.5281/zenodo.6000363 , 10.5281/zenodo.6000362
arXiv: 2201.03624
handle: 20.500.14279/29868
doi: 10.1609/aaai.v36i7.20763 , 10.48550/arxiv.2201.03624 , 10.5281/zenodo.6000363 , 10.5281/zenodo.6000362
arXiv: 2201.03624
handle: 20.500.14279/29868
This work aims to address the long-established problem of learning diversified representations. To this end, we combine information-theoretic arguments with stochastic competition-based activations, namely Stochastic Local Winner-Takes-All (LWTA) units. In this context, we ditch the conventional deep architectures commonly used in Representation Learning, that rely on non-linear activations; instead, we replace them with sets of locally and stochastically competing linear units. In this setting, each network layer yields sparse outputs, determined by the outcome of the competition between units that are organized into blocks of competitors. We adopt stochastic arguments for the competition mechanism, which perform posterior sampling to determine the winner of each block. We further endow the considered networks with the ability to infer the sub-part of the network that is essential for modeling the data at hand; we impose appropriate stick-breaking priors to this end. To further enrich the information of the emerging representations, we resort to information-theoretic principles, namely the Information Competing Process (ICP). Then, all the components are tied together under the stochastic Variational Bayes framework for inference. We perform a thorough experimental investigation for our approach using benchmark datasets on image classification. As we experimentally show, the resulting networks yield significant discriminative representation learning abilities. In addition, the introduced paradigm allows for a principled investigation mechanism of the emerging intermediate network representations.
FOS: Computer and information sciences, Artificial intelligence, Computer Science - Machine Learning, Information theory, Stochastic systems, Classification (of information), Machine Learning (stat.ML), Network layers, Electrical Engineering - Electronic Engineering - Information Engineering, Machine Learning (cs.LG), Statistics - Machine Learning, Engineering and Technology
FOS: Computer and information sciences, Artificial intelligence, Computer Science - Machine Learning, Information theory, Stochastic systems, Classification (of information), Machine Learning (stat.ML), Network layers, Electrical Engineering - Electronic Engineering - Information Engineering, Machine Learning (cs.LG), Statistics - Machine Learning, Engineering and Technology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 6 | |
| downloads | 7 |

Views provided by UsageCounts
Downloads provided by UsageCounts