
We study a stochastic and distributed algorithm for nonconvex problems whose objective consists of a sum of $N$ nonconvex $L_i/N$-smooth functions, plus a nonsmooth regularizer. The proposed NonconvEx primal-dual SpliTTing (NESTT) algorithm splits the problem into $N$ subproblems, and utilizes an augmented Lagrangian based primal-dual scheme to solve it in a distributed and stochastic manner. With a special non-uniform sampling, a version of NESTT achieves $��$-stationary solution using $\mathcal{O}((\sum_{i=1}^N\sqrt{L_i/N})^2/��)$ gradient evaluations, which can be up to $\mathcal{O}(N)$ times better than the (proximal) gradient descent methods. It also achieves Q-linear convergence rate for nonconvex $\ell_1$ penalized quadratic problems with polyhedral constraints. Further, we reveal a fundamental connection between primal-dual based methods and a few primal only methods such as IAG/SAG/SAGA.
35 pages, 2 figures
FOS: Computer and information sciences, Computer Science - Machine Learning, Non-linear Dynamics, Systems Engineering, Machine Learning (stat.ML), 510, Machine Learning (cs.LG), Statistics - Machine Learning, Optimization and Control (math.OC), Industrial Engineering, FOS: Mathematics, Mathematics - Optimization and Control
FOS: Computer and information sciences, Computer Science - Machine Learning, Non-linear Dynamics, Systems Engineering, Machine Learning (stat.ML), 510, Machine Learning (cs.LG), Statistics - Machine Learning, Optimization and Control (math.OC), Industrial Engineering, FOS: Mathematics, Mathematics - Optimization and Control
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
