Annie Marsden scite author profile

Annie Marsden

3Publications

0Citation Statements Received

73Citation Statements Given

How they've been cited

How they cite others

Affiliations

Stanford University

Publications

Order By: Most citations

Efficient Convex Optimization Requires Superlinear Memory

Marsden¹,

Sharan²,

Sidford³

et al. 2022

Preprint

View full text Add to dashboard Cite

We show that any memory-constrained, first-order algorithm which minimizes d-dimensional, 1-Lipschitz convex functions over the unit ball to 1/ poly(d) accuracy using at most d 1.25−δ bits of memory must make at least Ω(d 1+(4/3)δ ) first-order queries (for any constant δ ∈ [0, 1/4]). Consequently, the performance of such memory-constrained algorithms are a polynomial factor worse than the optimal Õ(d) query bound for this problem obtained by cutting plane methods that use Õ(d 2 ) memory. This resolves a COLT 2019 open problem of Woodworth and Srebro. * Part of the work was done while the author was at MIT. 1 This arises from taking at least Ω(d log(1/ )) iterations of working in some change of basis or solving a linear system, each of which take at least Ω(d 2 )-time naively.

show abstract

Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales

Kelner¹,

Marsden²,

Sharan³

et al. 2021

Preprint

View full text Add to dashboard Cite

We provide new gradient-based methods for efficiently solving a broad class of ill-conditioned optimization problems. We consider the problem of minimizing a function f : R d → R which is implicitly decomposable as the sum of m unknown non-interacting smooth, strongly convex functions and provide a method which solves this problem with a number of gradient evaluations that scales (up to logarithmic factors) as the product of the square-root of the condition numbers of the components. This complexity bound (which we prove is nearly optimal) can improve almost exponentially on that of accelerated gradient methods, which grow as the square root of the condition number of f . Additionally, we provide efficient methods for solving stochastic, quadratic variants of this multiscale optimization problem. Rather than learn the decomposition of f (which would be prohibitively expensive), our methods apply a clean recursive "Big-Step-Little-Step" interleaving of standard methods. The resulting algorithms use Õ(dm) space, are numerically stable, and open the door to a more fine-grained understanding of the complexity of convex optimization beyond condition number.

show abstract

Efficient Convex Optimization Requires Superlinear Memory (Extended Abstract)

Marsden

Sharan

Sidford

et al. 2023

View full text Add to dashboard Cite

Minimizing a convex function with access to a first order oracle---that returns the function evaluation and (sub)gradient at a query point---is a canonical optimization problem and a fundamental primitive in machine learning. Gradient-based methods are the most popular approaches used for solving the problem, owing to their simplicity and computational efficiency. These methods, however, do not achieve the information-theoretically optimal query complexity for minimizing the underlying function to small error, which are achieved by more expensive techniques based on cutting-plane methods. Is it possible to achieve the information-theoretically query complexity without using these more complex and computationally expensive methods? In this work, we use memory as a lens to understand this, and show that is is not possible to achieve optimal query complexity without using significantly more memory than that used by gradient descent.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Annie Marsden

Efficient Convex Optimization Requires Superlinear Memory

Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales

Efficient Convex Optimization Requires Superlinear Memory (Extended Abstract)

Contact Info

Product

Resources

About