2019
DOI: 10.48550/arxiv.1905.07769
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mean-Field Langevin Dynamics and Energy Landscape of Neural Networks

Abstract: We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations with a gradient flow structure in 2-Wasserstein metric, namely, the Mean-Field Langevin Dynamics (MFLD). Our work is motivated by a desire to provide a theoretical underpinning for the convergence of stochastic gradient type algorithms widely used for non-convex learning tasks such as training of deep neural networks. The key insight is that the certain class of the finite dimensional non-convex problems becomes… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
55
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(55 citation statements)
references
References 57 publications
0
55
0
Order By: Relevance
“…• By analyzing the proximal Gibbs distribution p q , we establish linear convergence in continuous time with respect to the KL-regularized objective. This convergence result holds for any regularization parameters, in contrast to existing analyses (e.g., Hu et al (2019)) that require strong regularization.…”
Section: Contributionsmentioning
confidence: 57%
See 3 more Smart Citations
“…• By analyzing the proximal Gibbs distribution p q , we establish linear convergence in continuous time with respect to the KL-regularized objective. This convergence result holds for any regularization parameters, in contrast to existing analyses (e.g., Hu et al (2019)) that require strong regularization.…”
Section: Contributionsmentioning
confidence: 57%
“…This fact was shown in Hu et al (2019). Hence, we may interpret the divergence between probability density functions q and p q as an optimization gap.…”
Section: Basic Properties and Convexitymentioning
confidence: 67%
See 2 more Smart Citations
“…The Langevin equation is the cornerstone of modern nonequilibrium statistical mechanics [1,2], with profound implications for numerical computations [3,4]. It is the basis upon which one builds the diffusion equations [5][6][7][8], fluctuation-dissipation theorems [1,9], Brownian motion with state-dependent diffusion coefficient and its path-integral formulation [10], and instrumental in areas overlapping with other fundamental theories of physics, such as the question of quantum decoherence [11,12].…”
Section: Introductionmentioning
confidence: 99%