A derivative-free method for structured optimization problems

Cristofari, Andrea; Rinaldi, Francesco

doi:10.48550/arxiv.2005.05224

Cited by 2 publications

(2 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Large-Scale DFO There have been several alternative approaches considered for improving the scalability of DFO. These often consider problems with specific structure which enable efficient model construction, such as partial separability [24,60], sparse Hessians [4], and minimization over the convex hull of finitely many points [27]. On the other hand, there is a growing body of literature on 'gradient sampling' techniques for machine learning problems.…”

Section: Existing Literaturementioning

confidence: 99%

Scalable Subspace Methods for Derivative-Free Nonlinear Least-Squares Optimization

Cartis¹,

Roberts²

2021

Preprint

View full text Add to dashboard Cite

We introduce a general framework for large-scale model-based derivative-free optimization based on iterative minimization within random subspaces. We present a probabilistic worst-case complexity analysis for our method, where in particular we prove high-probability bounds on the number of iterations before a given optimality is achieved. This framework is specialized to nonlinear least-squares problems, with a model-based framework based on the Gauss-Newton method. This method achieves scalability by constructing local linear interpolation models to approximate the Jacobian, and computes new steps at each iteration in a subspace with user-determined dimension. We then describe a practical implementation of this framework, which we call DFBGN. We outline efficient techniques for selecting the interpolation points and search subspace, yielding an implementation that has a low per-iteration linear algebra cost (linear in the problem dimension) while also achieving fast objective decrease as measured by evaluations. Extensive numerical results demonstrate that DFBGN has improved scalability, yielding strong performance on large-scale nonlinear least-squares problems.

show abstract

Section: Existing Literaturementioning

confidence: 99%

Scalable Subspace Methods for Derivative-Free Nonlinear Least-Squares Optimization

Cartis¹,

Roberts²

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…This type of problem arises across a broad range of application areas (Conn et al, 2009;Audet & Hare, 2017), but has attracted particular recent attention in the learning community for problems such as black-box attacks (Chen et al, 2017;Ughi et al, 2019), hyperparameter tuning (Ghanbari & Scheinberg, 2017;Lakhmiri et al, 2020) and reinforcement learning (Mania et al, 2018;Choromanski et al, 2019). A cur-rent deficiency of DFO methods is their performance on large-scale problems, which is critical to their utility in machine learning; there have been several recent works aimed at improving the scalability of DFO (Bergou et al, 2019;Roberts, 2019;Porcelli & Toint, 2020;Cristofari & Rinaldi, 2020).…”

Section: Introductionmentioning

confidence: 99%

Scalable Derivative-Free Optimization for Nonlinear Least-Squares Problems

Cartis¹,

Ferguson²,

Roberts³

2020

Preprint

View full text Add to dashboard Cite

Derivative-free-or zeroth-order-optimization (DFO) has gained recent attention for its ability to solve problems in a variety of application areas, including machine learning, particularly involving objectives which are stochastic and/or expensive to compute. In this work, we develop a novel model-based DFO method for solving nonlinear least-squares problems. We improve on state-of-the-art DFO by performing dimensionality reduction in the observational space using sketching methods, avoiding the construction of a full local model. Our approach has a per-iteration computational cost which is linear in problem dimension in a big data regime, and numerical evidence demonstrates that, compared to existing software, it has dramatically improved runtime performance on overdetermined least-squares problems.

show abstract

A derivative-free method for structured optimization problems

Cited by 2 publications

References 30 publications

Scalable Subspace Methods for Derivative-Free Nonlinear Least-Squares Optimization

Scalable Subspace Methods for Derivative-Free Nonlinear Least-Squares Optimization

Scalable Derivative-Free Optimization for Nonlinear Least-Squares Problems

Contact Info

Product

Resources

About