2022
DOI: 10.48550/arxiv.2205.01608
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Local Stochastic Bilevel Optimization with Momentum-Based Variance Reduction

Abstract: Bilevel Optimization has witnessed notable progress recently with new emerging efficient algorithms and has been applied to many machine learning tasks such as data cleaning, few-shot learning, and neural architecture search. However, little attention has been paid to solve the bilevel problems under distributed setting. Federated learning (FL) is an emerging paradigm which solves machine learning tasks over distributed-located data. FL problems are challenging to solve due to the heterogeneity and communicati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(11 citation statements)
references
References 49 publications
0
11
0
Order By: Relevance
“…We are ready to prove the main results in Theorem A. 16. We first summarize main results in Lemmas A.19, A.8 and A.7:…”
Section: A11 Case 1: Assumption 24 Holdsmentioning
confidence: 94%
See 2 more Smart Citations
“…We are ready to prove the main results in Theorem A. 16. We first summarize main results in Lemmas A.19, A.8 and A.7:…”
Section: A11 Case 1: Assumption 24 Holdsmentioning
confidence: 94%
“…In this section we will prove the following convergence result of the DBO algorithm: Theorem A. 16. In Algorithm 3, suppose Assumptions 2.1, 2.2, and 2.3 hold.…”
Section: A1 Proof Of the Dbo Convergencementioning
confidence: 99%
See 1 more Smart Citation
“…the inner problem has multiple minimizers [25,49]. A recent work [27] applied momentum-based acceleration to solve federated bilevel optimization problems.…”
Section: Related Workmentioning
confidence: 99%
“…There are much fewer studies on BO in a distributed setting. The recent works [25,34] consider a BO where both the outer and inner problems are defined with the expectations over data distributed across nodes. They analyze the communication complexity of their methods in a non-convex setting.…”
Section: Related Workmentioning
confidence: 99%