2020
DOI: 10.48550/arxiv.2006.02509
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SVGD as a kernelized Wasserstein gradient flow of the chi-squared divergence

Abstract: Stein Variational Gradient Descent (SVGD), a popular sampling algorithm, is often described as the kernelized gradient flow for the Kullback-Leibler divergence in the geometry of optimal transport. We introduce a new perspective on SVGD that instead views SVGD as the (kernelized) gradient flow of the chi-squared divergence which, we show, exhibits a strong form of uniform exponential ergodicity under conditions as weak as a Poincaré inequality. This perspective leads us to propose an alternative to SVGD, calle… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 16 publications
0
8
0
Order By: Relevance
“…In the above display, F stands for an appropriate free energy functional, |•| ρt and |•| ρ * t for suitable dual norms, and ε > 0 is assumed to be small. Moreover, Onsager and Machlup demonstrated that these constituents define a corresponding gradient flow structure 13 . Note that the exponent in (53) has the dimensions of a free energy (ignoring the Boltzmann constant and the constant temperature), which is consistent with the Boltzmann-Gibbs-Helmholtz free energy as described above.…”
Section: Connecting Gradient Flows To Large Deviationsmentioning
confidence: 99%
See 4 more Smart Citations
“…In the above display, F stands for an appropriate free energy functional, |•| ρt and |•| ρ * t for suitable dual norms, and ε > 0 is assumed to be small. Moreover, Onsager and Machlup demonstrated that these constituents define a corresponding gradient flow structure 13 . Note that the exponent in (53) has the dimensions of a free energy (ignoring the Boltzmann constant and the constant temperature), which is consistent with the Boltzmann-Gibbs-Helmholtz free energy as described above.…”
Section: Connecting Gradient Flows To Large Deviationsmentioning
confidence: 99%
“…As stated in the Introduction, any evolution equation of gradient flow type in fact admits many other non-equivalent gradient flow structures [20]. In the case of the Stein PDE (8) this phenomenon is exemplified by the structures proposed in [44,23] and [13]. However, each gradient flow structure is related to a particular form of the noise in the corresponding interacting particle system.…”
Section: Connecting Gradient Flows To Large Deviationsmentioning
confidence: 99%
See 3 more Smart Citations