Ergodic mirror descent

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Sep 2011Embargo end date: 01 Jan 2011Publisher:IEEEJournal:2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

Authors: John C. Duchi; Alekh Agarwal; Mikael Johansson 0001; Michael I. Jordan;

doi: 10.1109/allerton.2011.6120236 , 10.1137/110836043 , 10.48550/arxiv.1105.4681

arXiv: 1105.4681

Ergodic mirror descent

- Summary
- Subjects
- Metrics

Abstract

We generalize stochastic subgradient descent methods to situations in which we do not receive independent samples from the distribution over which we optimize, but instead receive samples that are coupled over time. We show that as long as the source of randomness is suitably ergodic---it converges quickly enough to a stationary distribution---the method enjoys strong convergence guarantees, both in expectation and with high probability. This result has implications for stochastic optimization in high-dimensional spaces, peer-to-peer distributed optimization schemes, decision problems with dependent data, and stochastic optimization problems over combinatorial spaces.

35 pages, 2 figures

Related Organizations

Royal Institute of Technology
Sweden
University of California, Los Angeles
United States
University of California
United States
University of California
United States
University of California
United States

View all View all

Keywords

FOS: Computer and information sciences, Statistics - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Mathematics - Optimization and Control

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	51
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average