On the von Neumann and Frank--Wolfe Algorithms with Away Steps

Peña, Javier; Rodríguez, Daniel; Soheili, Negar

doi:10.1137/15m1009937

Cited by 11 publications

(7 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Now note that since Algorithm 1 is a decent method (i.e., the function value never increases from one iteration to the next), and so all iterates as well as the optimal set X * are contained within the initial level set L, and thus for any t ≥ 1, we can bound dist(x t , X * ) by D L . Thus, using (13) we have that,…”

Section: A Proof Of Theoremmentioning

confidence: 99%

“…It is well-known that this rate does not improve even if the objective function is strongly convex (see for instance [4]), a property that is well known to allow for faster convergence rates, and in particular linear rates, for projected/proximal gradient methods [5,6]. Indeed, in recent years there is a significant research effort to design Frank-Wolfe variants with linear convergence rates under strong convexity or the weaker assumption of quadratic growth (see Definition 1 in the sequel), with most efforts focused on the case in which the feasible set is a convex and compact polytope [7,8,9,10,11,12,13,14,15,16].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Frank-Wolfe with a Nearest Extreme Point Oracle

Garber

Wolf

2021

Preprint

View full text Add to dashboard Cite

We consider variants of the classical Frank-Wolfe algorithm for constrained smooth convex minimization, that instead of access to the standard oracle for minimizing a linear function over the feasible set, have access to an oracle that can find an extreme point of the feasible set that is closest in Euclidean distance to a given vector. We first show that for many feasible sets of interest, such an oracle can be implemented with the same complexity as the standard linear optimization oracle. We then show that with such an oracle we can design new Frank-Wolfe variants which enjoy significantly improved complexity bounds in case the set of optimal solutions lies in the convex hull of a subset of extreme points with small diameter (e.g., a low-dimensional face of a polytope). In particular, for many 0-1 polytopes, under quadratic growth and strict complementarity conditions, we obtain the first linearly convergent variant with rate that depends only on the dimension of the optimal face and not on the ambient dimension.

show abstract

Section: A Proof Of Theoremmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Frank-Wolfe with a Nearest Extreme Point Oracle

Garber

Wolf

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Later on, Guelat and Marcotte [24] show that a modified version of the away-step Frank-Wolfe has a linear convergence guarantee when the objective is strongly convex and the feasible region is a polytope. The linear convergence guarantee of the away-step Frank-Wolfe method has been improved recently with simpler algorithms and a more straightforward analysis, see for example, [31,6,44,43].…”

Section: Related Literaturementioning

confidence: 99%

“…The iterate solution can land on a certain face of the constraint, which avoids this zigzagging phenomenon. Moreover, with away-steps, the Frank-Wolfe method enjoys a linear convergence rate when the objective is further strongly convex and the constraint set is a polytope [31,44,43]. In addition to faster convergence, the away-step Frank-Wolfe method can usually lead to a sparser solution, which is helpful in many applications when sparsity and interpretability are desirable properties of the solution [22].…”

Section: Unbounded Frank-wolfe Algorithmsmentioning

confidence: 99%

Frank-Wolfe Methods with an Unbounded Feasible Region and Applications to Structured Learning

Wang¹,

Lu²,

Mazumder³

2020

Preprint

View full text Add to dashboard Cite

The Frank-Wolfe (FW) method is a popular algorithm for solving large-scale convex optimization problems appearing in structured statistical learning. However, the traditional Frank-Wolfe method can only be applied when the feasible region is bounded, which limits its applicability in practice. Motivated by two applications in statistical learning, the ℓ1 trend filtering problem and matrix optimization problems with generalized nuclear norm constraints, we study a family of convex optimization problems where the unbounded feasible region is the direct sum of an unbounded linear subspace and a bounded constraint set. We propose two new Frank-Wolfe methods: unbounded Frank-Wolfe method (uFW) and unbounded Away-Step Frank-Wolfe method (uAFW), for solving a family of convex optimization problems with this class of unbounded feasible regions. We show that under proper regularity conditions, the unbounded Frank-Wolfe method has a O(1/k) sublinear convergence rate, and unbounded Away-Step Frank-Wolfe method has a linear convergence rate, matching the best-known results for the Frank-Wolfe method when the feasible region is bounded. Furthermore, computational experiments indicate that our proposed methods appear to outperform alternative solvers.

show abstract

“…In recent years Garber and Hazan [10,12] and then Simon Lacoste Julien and Jaggi [20] presented variants of the Frank-Wolfe method that utilize away steps alongside new analyses, which resulted in provable and explicit linear rates without requiring strict complementarity conditions and without dependence on the location of the optimal solution. These results have encouraged much followup theoretical and empirical work e.g., [2,24,23,14,25,13,26,16,5,15,1,4,21,7], to name a few. However, the linear convergence rates in [10,12,20] and follow-up works depend explicitly on the dimension of the problem (at least linear dependence, i.e., the convergence rate is of the form exp(−Θ(t/d)), where d is the dimension) 1 .…”

Section: Introductionmentioning

confidence: 97%

Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity

Garber

2020

Preprint

View full text Add to dashboard Cite

In recent years it was proved that simple modifications of the classical Frank-Wolfe algorithm (aka conditional gradient algorithm) for smooth convex minimization over convex and compact polytopes, converge with linear rate, assuming the objective function has the quadratic growth property. However, the rate of these methods depends explicitly on the dimension of the problem which cannot explain their empirical success for large scale problems. In this paper we first demonstrate that already for very simple problems and even when the optimal solution lies on a low-dimensional face of the polytope, such dependence on the dimension cannot be avoided in worst case. We then revisit the addition of a strict complementarity assumption already considered in Wolfe's classical book [27], and prove that under this condition, the Frank-Wolfe method with away-steps and line-search converges linearly with rate that depends explicitly only on the dimension of the optimal face. We motivate strict complementarity by proving that it implies sparsity-robustness of optimal solutions to noise.

show abstract

On the von Neumann and Frank--Wolfe Algorithms with Away Steps

Cited by 11 publications

References 22 publications

Frank-Wolfe with a Nearest Extreme Point Oracle

Frank-Wolfe with a Nearest Extreme Point Oracle

Frank-Wolfe Methods with an Unbounded Feasible Region and Applications to Structured Learning

Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity

Contact Info

Product

Resources

About