2017
DOI: 10.48550/arxiv.1703.01256
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The Global Optimization Geometry of Low-Rank Matrix Optimization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
34
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
2

Relationship

5
2

Authors

Journals

citations
Cited by 21 publications
(35 citation statements)
references
References 24 publications
1
34
0
Order By: Relevance
“…The proof of Lemma 3.2 is given in Appendix D. As is shown in existing literature [21,22,23], the Gaussian linear operator A : R N ×N → R M introduced at the beginning of Section 3.1 satisfies the RIP condition (3.4) with high probability if M ≥ C(r + k)N 1 δ 2 r+k for some numerical constant C. Therefore, we can conclude that the three statements in Theorem 2.1 hold for the empirical risk (3.2) and population risk (3.3) as long as M is large enough. Some similar bounds for the sample complexity M under different settings can also be found in papers [7,4]. Note that the particular choice of l can guarantee that grad f (U) F is large outside of B(l), which is also proved in Appendix D. Together with Theorem 2.1, we prove a globally benign landscape for the empirical risk.…”
Section: Verifying Assumptions 21 22 and 23supporting
confidence: 70%
See 3 more Smart Citations
“…The proof of Lemma 3.2 is given in Appendix D. As is shown in existing literature [21,22,23], the Gaussian linear operator A : R N ×N → R M introduced at the beginning of Section 3.1 satisfies the RIP condition (3.4) with high probability if M ≥ C(r + k)N 1 δ 2 r+k for some numerical constant C. Therefore, we can conclude that the three statements in Theorem 2.1 hold for the empirical risk (3.2) and population risk (3.3) as long as M is large enough. Some similar bounds for the sample complexity M under different settings can also be found in papers [7,4]. Note that the particular choice of l can guarantee that grad f (U) F is large outside of B(l), which is also proved in Appendix D. Together with Theorem 2.1, we prove a globally benign landscape for the empirical risk.…”
Section: Verifying Assumptions 21 22 and 23supporting
confidence: 70%
“…The landscape of the above population risk has been studied in the general R N ×k space with k = r in [7]. The landscape of its variants, such as the asymmetric version with or without a balanced term, has also been studied in [4,18]. It is well known that there exists an ambiguity in the solution of (3.2) due to the fact that UU = UQQ U holds for any orthogonal matrix Q ∈ R k×k .…”
Section: Matrix Sensingmentioning
confidence: 99%
See 2 more Smart Citations
“…On the other hand, first-order method such as projected gradient descent (PGD) for solving (2) suffers from very slow convergence. As a proof of concept, we show in Figure 1 the convergence of PGD for solving symmetric NMF and as a comparison, the convergence of gradient descent (GD) for solving a matrix factorization (MF) (i.e., (2) without the nonnegative constraint) which is proved to admit linear convergence [13,14]. This phenomenon also appears in nonsymmetric NMF and is the main motivation to have many efficient algorithms such as ANLS and HALS.…”
Section: Introductionmentioning
confidence: 96%