
arXiv: 2312.04928
This paper investigates the influence of directed networks on decentralized stochastic non-convex optimization associated with column-stochastic mixing matrices. Surprisingly, we find that the canonical spectral gap, a widely used metric in undirected networks, is insufficient to characterize the impact of directed topology on decentralized algorithms. To overcome this limitation, we introduce a novel metric termed equilibrium skewness. This metric, together with the spectral gap, accurately and comprehensively captures the influence of column-stochastic mixing matrices on decentralized stochastic algorithms. With these two metrics, we clarify, for the first time, how the directed network topology influences the performance of prevalent algorithms such as Push-Sum and Push-Diging. Furthermore, we establish the first lower bound of the convergence rate for decentralized stochastic non-convex algorithms over directed networks. Since existing algorithms cannot match our lower bound, we further propose the MG-Push-Diging algorithm, which integrates Push-Diging with a multi-round gossip technique. MG-Push-Diging attains our lower bound up to logarithmic factors, demonstrating its near-optimal performance and the tightness of the lower bound. Numerical experiments verify our theoretical results.
directed networks, Optimization and Control (math.OC), FOS: Mathematics, Stochastic programming, push-sum, lower bound, Nonconvex programming, global optimization, decentralized optimization, Mathematics - Optimization and Control
directed networks, Optimization and Control (math.OC), FOS: Mathematics, Stochastic programming, push-sum, lower bound, Nonconvex programming, global optimization, decentralized optimization, Mathematics - Optimization and Control
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
