Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents

Kumar, Bhumesh; Borkar, Vivek S.; Shetty, Akhil

doi:10.1007/s00498-019-00249-4

Cited by 6 publications

(2 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Constant stepsize RSAs often converge much faster to a neighborhood of the desired solution. This phenomenon has been observed in off-policy temporal difference learning [52], temporal difference learning with function approximation [33], tracking problems [31], and gradient descent [4], among others. Furthermore, the size of this neighborhood is usually small if the stepsize is small (so too large a stepsize may not be beneficial) [13,7].…”

mentioning

confidence: 83%

Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence

Gupta¹,

Haskell²

2021

SIAM Journal on Mathematics of Data Science

View full text Add to dashboard Cite

This paper develops a unified framework, based on iterated random operator theory, to analyze the convergence of constant stepsize recursive stochastic algorithms (RSAs). RSAs use randomization to efficiently compute expectations, and so their iterates form a stochastic process. The key idea of our analysis is to lift the RSA into an appropriate higher-dimensional space and then express it as an equivalent Markov chain. Instead of determining the convergence of this Markov chain (which may not converge under constant stepsize), we study the convergence of the distribution of this Markov chain. To study this, we define a new notion of Wasserstein divergence. We show that if the distribution of the iterates in the Markov chain satisfy a contraction property with respect to the Wasserstein divergence, then the Markov chain admits an invariant distribution. We show that convergence of a large family of constant stepsize RSAs can be understood using this framework, and we provide several detailed examples.

show abstract

mentioning

confidence: 83%

Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence

Gupta¹,

Haskell²

2021

SIAM Journal on Mathematics of Data Science

View full text Add to dashboard Cite

show abstract

“…To the best of our knowledge, this paper provides the first finite-time convergence analysis for DSA scheme with biased updates relying on Markov samples. In addition, this work is related to the recent works on non-asymptotic analysis of SA schemes [14][15][16].…”

Section: Introductionmentioning

confidence: 99%

On the Convergence of Consensus Algorithms with Markovian Noise and Gradient Bias

Wai

2020

Preprint

View full text Add to dashboard Cite

This paper presents a finite time convergence analysis for a decentralized stochastic approximation (SA) scheme. The scheme generalizes several algorithms for decentralized machine learning and multi-agent reinforcement learning. Our proof technique involves separating the iterates into their respective consensual parts and consensus error. The consensus error is bounded in terms of the stationarity of the consensual part, while the updates of the consensual part can be analyzed as a perturbed SA scheme. Under the Markovian noise and time varying communication graph assumptions, the decentralized SA scheme has an expected convergence rate of O(log T / √ T ), where T is the iteration number, in terms of squared norms of gradient for nonlinear SA with smooth but non-convex cost function. This rate is comparable to the best known performances of SA in a centralized setting with a non-convex potential function.

show abstract

Untitled

Borkar

2022

Stochastic Approximation: A Dynamical Systems Viewpoint

View full text Add to dashboard Cite

Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents

Cited by 6 publications

References 48 publications

Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence

Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence

On the Convergence of Consensus Algorithms with Markovian Noise and Gradient Bias

Untitled

Contact Info

Product

Resources

About