
arXiv: 2311.05381
We propose a novel generalization of the conditional gradient (CG / Frank-Wolfe) algorithm for minimizing a smooth function $f$ under an intersection of compact convex sets, using a first-order oracle for $\nabla f$ and linear minimization oracles (LMOs) for the individual sets. Although this computational framework presents many advantages, there are only a small number of algorithms which require one LMO evaluation per set per iteration; furthermore, these algorithms require $f$ to be convex. Our algorithm appears to be the first in this class which is proven to also converge in the nonconvex setting. Our approach combines a penalty method and a product-space relaxation. We show that one conditional gradient step is a sufficient subroutine for our penalty method to converge, and we provide several analytical results on the product-space relaxation's properties and connections to other problems in optimization. We prove that our average Frank-Wolfe gap converges at a rate of $\mathcal{O}(\ln t/\sqrt{t})$, -- only a log factor worse than the vanilla CG algorithm with one set.
22 pages, 3 figures
Numerical optimization and variational techniques, Convex programming, splitting, Applications of functional analysis in optimization, convex analysis, mathematical programming, economics, Nonconvex programming, global optimization, conditional gradient, Frank-Wolfe algorithm, Nonlinear programming, Optimization and Control (math.OC), FOS: Mathematics, 46N10, 65K10, 90C25, 90C26, 90C30, nonconvex function, Mathematics - Optimization and Control, projection free
Numerical optimization and variational techniques, Convex programming, splitting, Applications of functional analysis in optimization, convex analysis, mathematical programming, economics, Nonconvex programming, global optimization, conditional gradient, Frank-Wolfe algorithm, Nonlinear programming, Optimization and Control (math.OC), FOS: Mathematics, 46N10, 65K10, 90C25, 90C26, 90C30, nonconvex function, Mathematics - Optimization and Control, projection free
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
