
arXiv: 1503.08060
SummaryExpectation propagation (EP) is a widely successful algorithm for variational inference. EP is an iterative algorithm used to approximate complicated distributions, typically to find a Gaussian approximation of posterior distributions. In many applications of this type, EP performs extremely well. Surprisingly, despite its widespread use, there are very few theoretical guarantees on Gaussian EP, and it is quite poorly understood. To analyse EP, we first introduce a variant of EP: averaged EP, which operates on a smaller parameter space. We then consider averaged EP and EP in the limit of infinite data, where the overall contribution of each likelihood term is small and where posteriors are almost Gaussian. In this limit, we prove that the iterations of both averaged EP and EP are simple: they behave like iterations of Newton’s algorithm for finding the mode of a function. We use this limit behaviour to prove that EP is asymptotically exact, and to obtain other insights into the dynamic behaviour of EP, e.g. that it may diverge under poor initialization exactly like Newton’s method. EP is a simple algorithm to state, but a difficult one to study. Our results should facilitate further research into the theoretical properties of this important method.
Statistics and Probability, FOS: Computer and information sciences, Bayesian inference, Mathematics - Statistics Theory, Statistics Theory (math.ST), algorithms, Statistics - Computation, 004, 510, models, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], statistics, equations, Characterization and structure theory of statistical distributions, expectation propagation, FOS: Mathematics, Statistics, Probability and Uncertainty, [STAT.CO]Statistics [stat]/Computation [stat.CO], Computation (stat.CO), variational inference
Statistics and Probability, FOS: Computer and information sciences, Bayesian inference, Mathematics - Statistics Theory, Statistics Theory (math.ST), algorithms, Statistics - Computation, 004, 510, models, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], statistics, equations, Characterization and structure theory of statistical distributions, expectation propagation, FOS: Mathematics, Statistics, Probability and Uncertainty, [STAT.CO]Statistics [stat]/Computation [stat.CO], Computation (stat.CO), variational inference
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 11 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
