2021
DOI: 10.48550/arxiv.2109.13055
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Minimax Mixing Time of the Metropolis-Adjusted Langevin Algorithm for Log-Concave Sampling

Abstract: We study the mixing time of the Metropolis-adjusted Langevin algorithm (MALA) for sampling from a log-smooth and strongly log-concave distribution. We establish its optimal minimax mixing time under a warm start. Our main contribution is two-fold. First, for a d-dimensional log-concave density with condition number κ, we show that MALA with a warm start mixes in Õ(κ √ d) iterations up to logarithmic factors. This improves upon the previous work on the dependency of either the condition number κ or the dimensio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
9
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(10 citation statements)
references
References 44 publications
1
9
0
Order By: Relevance
“…In other words, sampling is strictly easier in the presence of an initialization with bounded density ratio to the target (i.e., a warm start) than an initialization with bounded KL divergence. This is consistent with intuition from prior work on the complexity of the Metropolis-adjusted Langevin algorithm (see Chewi et al, 2021;Lee et al, 2021;Wu et al, 2021).…”
Section: Bump Construction and The Second Lower Boundsupporting
confidence: 89%
“…In other words, sampling is strictly easier in the presence of an initialization with bounded density ratio to the target (i.e., a warm start) than an initialization with bounded KL divergence. This is consistent with intuition from prior work on the complexity of the Metropolis-adjusted Langevin algorithm (see Chewi et al, 2021;Lee et al, 2021;Wu et al, 2021).…”
Section: Bump Construction and The Second Lower Boundsupporting
confidence: 89%
“…These algorithms can be combined with Metropolis-Hastings filters to correct the bias of the algorithm's stationary distribution; this typically improves mixing times to π from polynomial to polylogarithmic in the inverse accuracy. A notable example is the Metropolis-Adjusted Langevin Algorithm [78], whose mixing time has been the focus of much recent work [22,36] and was recently characterized in [91]. While there have been many more exciting developments, this literature is far too large to attempt to comprehensively cite here, and we refer the reader to textbooks and surveys such as [4,20,61,76,77] for further background.…”
Section: Related Workmentioning
confidence: 99%
“…The behaviour of the mixing time of the Metropolis-Adjusted Langevin Algorithm (MALA) and the Unadjusted Langevin Algorithm (ULA) has been extensively studied in the last few years, with particular attention being given to its dimensional dependence. See Durmus and Moulines (2019); Li et al (2021); Chewi et al (2021); Wu et al (2021) for recent theoretical contributions to this area. In contrast to theoretical bounds, which may only be informative in the very high-dimensional regime, our empirical bounds are informative in the low-to-moderate dimensional setting as well.…”
Section: Dimensional Scaling Of Ula and Malamentioning
confidence: 99%