
arXiv: 1508.03606
We investigate the representation of hierarchical models in terms of marginals of other hierarchical models with smaller interactions. We focus on binary variables and marginals of pairwise interaction models whose hidden variables are conditionally independent given the visible variables. In this case the problem is equivalent to the representation of linear subspaces of polynomials by feedforward neural networks with soft-plus computational units. We show that every hidden variable can freely model multiple interactions among the visible variables, which allows us to generalize and improve previous results. In particular, we show that a restricted Boltzmann machine with less than $[ 2(\log(v)+1) / (v+1) ] 2^v-1$ hidden binary variables can approximate every distribution of $v$ visible binary variables arbitrarily well, compared to $2^{v-1}-1$ from the best previously known result.
18 pages, 4 figures, 2 tables, WUPES'15
FOS: Computer and information sciences, Artificial intelligence, Computer Science - Machine Learning, Artificial Intelligence and Image Processing, soft-plus unit, cs.LG, Learning and adaptive systems in artificial intelligence, Mathematics - Statistics Theory, Statistics Theory (math.ST), math.PR, Machine Learning (cs.LG), Machine Learning, Information and Computing Sciences, FOS: Mathematics, stat.TH, Interaction model, Artificial Intelligence & Image Processing, hierarchical model, cs.NE, Neural and Evolutionary Computing (cs.NE), Rectified linear unit, Numerical and Computational Mathematics, rectified linear unit, Statistics, Probability (math.PR), Computer Science - Neural and Evolutionary Computing, graphical model, Soft-plus unit, math.ST, interaction model, Hierarchical model, Restricted Boltzmann machine, Graphical model, restricted Boltzmann machine, Mathematics - Probability
FOS: Computer and information sciences, Artificial intelligence, Computer Science - Machine Learning, Artificial Intelligence and Image Processing, soft-plus unit, cs.LG, Learning and adaptive systems in artificial intelligence, Mathematics - Statistics Theory, Statistics Theory (math.ST), math.PR, Machine Learning (cs.LG), Machine Learning, Information and Computing Sciences, FOS: Mathematics, stat.TH, Interaction model, Artificial Intelligence & Image Processing, hierarchical model, cs.NE, Neural and Evolutionary Computing (cs.NE), Rectified linear unit, Numerical and Computational Mathematics, rectified linear unit, Statistics, Probability (math.PR), Computer Science - Neural and Evolutionary Computing, graphical model, Soft-plus unit, math.ST, interaction model, Hierarchical model, Restricted Boltzmann machine, Graphical model, restricted Boltzmann machine, Mathematics - Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 14 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
