Abstract:We present new adaptive sampling rules for the sketch-and-project method for solving linear systems. To deduce our new sampling rules, we first show how the progress of one step of the sketch-and-project method depends directly on a sketched residual. Based on this insight, we derive a 1) max-distance sampling rule, by sampling the sketch with the largest sketched residual 2) a proportional sampling rule, by sampling proportional to the sketched residual, and finally 3) a capped sampling rule. The capped sampl… Show more
“…Related Results. This result is related to recent work of Gower, Molitor, Moorman and Needell [11] who discuss non-uniform selection probabilities in the more general framework of Sketch-and-Project methods. In particular, their §7.4.…”
Section: 3supporting
confidence: 65%
“…discusses the convergence rate of the algorithm considered here for p = 1. The case p = ∞ has been studied by a large number of people; we emphasize the results of Nutini, Sepehey, Laradji, Schmidt, Koepke, Virani [29] and to references in [11].…”
The Kaczmarz method for solving a linear system Ax = b interprets such a system as a collection of equations a i , x = b i , where a i is the i−th row of A, then picks such an equation and corrects x k+1 = x k + λa i where λ is chosen so that the i−th equation is satisfied. Convergence rates are difficult to establish. Assuming the rows to be normalized, a i 2 = 1, Strohmer & Vershynin established that if the order of equations is chosen at random, E x k − x 2 converges exponentially. We prove that if the i−th row is selected with likelihood proportional to | a i , x k − b i | p , where 0 < p < ∞, then E x k −x 2 converges faster than the purely random method. As p → ∞, the method de-randomizes and explains, among other things, why the maximal correction method works well. We empirically observe that the method computes approximations of small singular vectors of A as a byproduct.
“…Related Results. This result is related to recent work of Gower, Molitor, Moorman and Needell [11] who discuss non-uniform selection probabilities in the more general framework of Sketch-and-Project methods. In particular, their §7.4.…”
Section: 3supporting
confidence: 65%
“…discusses the convergence rate of the algorithm considered here for p = 1. The case p = ∞ has been studied by a large number of people; we emphasize the results of Nutini, Sepehey, Laradji, Schmidt, Koepke, Virani [29] and to references in [11].…”
The Kaczmarz method for solving a linear system Ax = b interprets such a system as a collection of equations a i , x = b i , where a i is the i−th row of A, then picks such an equation and corrects x k+1 = x k + λa i where λ is chosen so that the i−th equation is satisfied. Convergence rates are difficult to establish. Assuming the rows to be normalized, a i 2 = 1, Strohmer & Vershynin established that if the order of equations is chosen at random, E x k − x 2 converges exponentially. We prove that if the i−th row is selected with likelihood proportional to | a i , x k − b i | p , where 0 < p < ∞, then E x k −x 2 converges faster than the purely random method. As p → ∞, the method de-randomizes and explains, among other things, why the maximal correction method works well. We empirically observe that the method computes approximations of small singular vectors of A as a byproduct.
“…, the Projection onto Convex Sets Method[2,6,7,11,36] and the Randomized Kaczmarz method[8,9,13,15,14,22,23,24,26,27,29,30,31,32,33,34,35,37,38,39,40,41,42].Strohmer & Vershynin …”
Suppose A ∈ R n×n is invertible and we are looking for the solution of Ax = b. Given an initial guess x 1 ∈ R, we show that by reflecting through hyperplanes generated by the rows of A, we can generate an infinite sequence (x k ) ∞ k=1 such that all elements have the same distance to the solution, i.e.If the hyperplanes are chosen at random, averages over the sequence converge andThe bound does not depend on the dimension of the matrix. This introduces a purely geometric way of attacking the problem: are there fast ways of estimating the location of the center of a sphere from knowing many points on the sphere? Our convergence rate (coinciding with that of the Random Kaczmarz method) comes from averaging, can one do better?
“…They showed that as the number of threads increases, the rate of convergence improves and the convergence horizon for inconsistent systems decreases. Another more general class of block methods are sketch-and-project methods [4]. For a linear system Ax = b, sketch-and-project methods iteratively project the current iterate onto the solution space of a sketched subsystem S T Ax = S T b.…”
The standard randomized sparse Kaczmarz (RSK) method is and algorithm to compute sparse solutions of linear systems of equations and uses sequential updates, and thus, does not take advantage of parallel computations. In this work, we introduce a parallel (mini batch) version of RSK based on averaging several Kaczmarz steps. Naturally, this method allows for parallelization and we show that it can also leverage large over-relaxation. We prove linear expected convergence and show that, given that parallel computations can be exploited, the method provably provides faster convergence than the standard method. This method can also be viewed as a variant of the linearized Bregman algorithm, a randomized dual block coordinate descent update, a stochastic mirror descent update, or a relaxed version of RSK and we recover the standard RSK method when the batch size is equal to one. We also provide estimates for inconsistent systems and show that the iterates convergence to an error in the order of the noise level. Finally, numerical examples illustrate the benefits of the new algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.