Optimization in High Dimensions via Accelerated, Parallel, and Proximal Coordinate Descent

Fercoq, Olivier; Richtárik, Peter

doi:10.1137/16m1085905

Cited by 18 publications

(11 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Note that, as for the iteration (4), in each step p m subproblems have to be solved but storage and update work increase. Remedies are available, see [6,23] for discussions on implementation issues. We have the following convergence result:…”

Section: Theoretical Resultsmentioning

confidence: 99%

Stochastic subspace correction methods and fault tolerance

Griebel

Oswald

2019

Math. Comp.

View full text Add to dashboard Cite

show abstract

Section: Theoretical Resultsmentioning

confidence: 99%

Stochastic subspace correction methods and fault tolerance

Griebel

Oswald

2019

Math. Comp.

View full text Add to dashboard Cite

show abstract

“…Besides a deterministic or greedy pick, we may also choose the next subproblem in a random fashion according to a probability distribution ρ on the set of subspaces, see [8] and the references cited therein. The analysis of such stochastic iterations has been a very active research topic in large-scale convex optimization, see [6] for a recent survey, but also in the area of machine learning and compressed sensing. Compared to the greedy approach, the cost for determining the next subspace is dramatically reduced to the cost of sampling the underlying probability distribution ρ.…”

Section: Introductionmentioning

confidence: 99%

Stochastic Subspace Correction in Hilbert Space

Griebel

Oswald

2018

Constr Approx

View full text Add to dashboard Cite

We consider an incremental approximation method for solving variational problems in infinite-dimensional separable Hilbert spaces, where in each step a randomly and independently selected subproblem from an infinite collection of subproblems is solved. We show that convergence rates for the expectation of the squared error can be guaranteed under weaker conditions than previously established in [9].

show abstract

“…Distributed optimization: In recent years, a lot of effort has been devoted to designing distributed first-order methods (Mahajan et al, 2013;Shamir and Srebro, 2014;Lee et al, 2017;Fercoq and Richtárik, 2016;Liu et al, 2014;Necoara and Clipici, 2016;Richtárik and Takáč, 2016;Liu et al, 2020), which only rely on gradient information of the objective function. However, first-order methods suffer from: (i) a dependence on a suitably defined condition number; (ii) spending more time on communication than on computation.…”

Section: Related Workmentioning

confidence: 99%

Learning Linear Models Using Distributed Iterative Hessian Sketching

Wang¹,

Anderson²

2021

Preprint

View full text Add to dashboard Cite

This work considers the problem of learning the Markov parameters of a linear system from observed data. Recent non-asymptotic system identification results have characterized the sample complexity of this problem in the single and multi-rollout setting. In both instances, the number of samples required in order to obtain acceptable estimates can produce optimization problems with an intractably large number of decision variables for a second-order algorithm. We show that a randomized and distributed Newton algorithm based on Hessian-sketching can produceoptimal solutions and converges geometrically. Moreover, the algorithm is trivially parallelizable. Our results hold for a variety of sketching matrices and we illustrate the theory with numerical examples.

show abstract

Optimization in High Dimensions via Accelerated, Parallel, and Proximal Coordinate Descent

Cited by 18 publications

References 33 publications

Stochastic subspace correction methods and fault tolerance

Stochastic subspace correction methods and fault tolerance

Stochastic Subspace Correction in Hilbert Space

Learning Linear Models Using Distributed Iterative Hessian Sketching

Contact Info

Product

Resources

About