
The information bottleneck (IB) problem tackles the issue of obtaining relevant compressed representations T of some random variable X for the task of predicting Y. It is defined as a constrained optimization problem that maximizes the information the representation has about the task, I ( T ; Y ) , while ensuring that a certain level of compression r is achieved (i.e., I ( X ; T ) ≤ r ). For practical reasons, the problem is usually solved by maximizing the IB Lagrangian (i.e., L IB ( T ; β ) = I ( T ; Y ) − β I ( X ; T ) ) for many values of β ∈ [ 0 , 1 ] . Then, the curve of maximal I ( T ; Y ) for a given I ( X ; T ) is drawn and a representation with the desired predictability and compression is selected. It is known when Y is a deterministic function of X, the IB curve cannot be explored and another Lagrangian has been proposed to tackle this problem: the squared IB Lagrangian: L sq − IB ( T ; β sq ) = I ( T ; Y ) − β sq I ( X ; T ) 2 . In this paper, we (i) present a general family of Lagrangians which allow for the exploration of the IB curve in all scenarios; (ii) provide the exact one-to-one mapping between the Lagrange multiplier and the desired compression rate r for known IB curve shapes; and (iii) show we can approximately obtain a specific compression level with the convex IB Lagrangian for both known and unknown IB curve shapes. This eliminates the burden of solving the optimization problem for many values of the Lagrange multiplier. That is, we prove that we can solve the original constrained problem with a single optimization.
FOS: Computer and information sciences, Computer Science - Machine Learning, information bottleneck; representation learning; mutual information; optimization, Science, QC1-999, Computer Science - Information Theory, information bottleneck, Machine Learning (stat.ML), Astrophysics, Article, Machine Learning (cs.LG), representation learning, Statistics - Machine Learning, Teknik och teknologier, mutual information, Physics, Information Theory (cs.IT), Q, QB460-466, Engineering and Technology, optimization
FOS: Computer and information sciences, Computer Science - Machine Learning, information bottleneck; representation learning; mutual information; optimization, Science, QC1-999, Computer Science - Information Theory, information bottleneck, Machine Learning (stat.ML), Astrophysics, Article, Machine Learning (cs.LG), representation learning, Statistics - Machine Learning, Teknik och teknologier, mutual information, Physics, Information Theory (cs.IT), Q, QB460-466, Engineering and Technology, optimization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 16 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
