Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization

Hu, Yifan; Chen, Xin; He, Niao

doi:10.48550/arxiv.1905.11957

Cited by 1 publication

(6 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This improves to O( −2 ) if both conditions hold, where we use O(•) to represent the rate hiding the logarithmic factors. In contrast to the SAA results in Hu et al [2019], these sample complexities are independent of the problem's dimensions. Furthermore, we show that, for weakly convex CSO problems (which are not necessarily smooth nor convex), BSGD requires a total sample complexity of O( −8 ) to achieve an -stationary point.…”

Section: Our Contributionsmentioning

confidence: 65%

“…The above result indicates that smoothness conditions make a difference in the total sample complexity of BSGD when solving CSO. It is worth pointing out that the sample complexity of BSGD matches with the that of ERM for strongly convex objectives esatblished in Hu et al [2019].…”

Section: Convergence For Strongly Convex Objectivesmentioning

confidence: 66%

“…However, this approach requires convexity of f and linearity of g, which are not satisfied in general applications especially when neural networks are used. SAA [Hu et al, 2019]…”

Section: Introductionmentioning

confidence: 99%

“…In this paper, we focus on the general CSO problem where multiple samples from the conditional distribution η|ξ are available, and the objective is not necessarily in the compositional form of a convex loss f ξ (•) and a linear mapping g η (•, ξ). A closely related work is Hu et al [2019], in which the authors study the generalization error bound and sample complexity of empirical risk minimization (ERM), a.k.a., sample average approximation (SAA) for general CSO. Differently, here we aim at developing efficient stochastic gradient-based methods that directly solve the CSO problem (1), for both convex and nonconvex settings.…”

Section: Introductionmentioning

confidence: 99%

“…For convex problems, this refers to the sample complexity required to find an -optimal solution in expectation; for nonconvex problems, this refers to the sample complexity required to find an -stationary point in expectation. Some of our results are summarized in Table 1 with a comparison to the sample complexity of SAA (or ERM) established in Hu et al [2019].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization

Hu,

Zhang,

Chen

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Conditional Stochastic Optimization (CSO) covers a variety of applications ranging from metalearning and causal inference to invariant learning. However, constructing unbiased gradient estimates in CSO is challenging due to the composition structure. As an alternative, we propose a biased stochastic gradient descent (BSGD) algorithm and study the bias-variance tradeoff under different structural assumptions. We establish the sample complexities of BSGD for strongly convex, convex, and weakly convex objectives, under smooth and non-smooth conditions. We also provide matching lower bounds of BSGD for convex CSO objectives. Extensive numerical experiments are conducted to illustrate the performance of BSGD on robust logistic regression, model-agnostic meta-learning (MAML), and instrumental variable regression (IV).

show abstract

Section: Our Contributionsmentioning

confidence: 65%

Section: Convergence For Strongly Convex Objectivesmentioning

confidence: 66%

“…However, this approach requires convexity of f and linearity of g, which are not satisfied in general applications especially when neural networks are used. SAA [Hu et al, 2019]…”

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization

Hu,

Zhang,

Chen

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization

Cited by 1 publication

References 35 publications

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization

Contact Info

Product

Resources

About