Optimization-Based AMP for Phase Retrieval: The Impact of Initialization and $\ell_{2}$  Regularization

Ma, Junjie; Ji, Xu; Maleki, Arian

doi:10.1109/tit.2019.2893254

Cited by 51 publications

(30 citation statements)

References 66 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recall that E[Y ] = 2x x + x 2 2 I n has a bounded spectral norm, then (136) implies that, to keep the deviation between Y and E[Y ] well-controlled, we must at least have (n log m)/m 1.…”

Section: Truncated Spectral Methods For Sample Efficiencymentioning

confidence: 99%

“…A few other nonconvex matrix factorization algorithms have been left out due to space, including but not limited to normalized iterative hard thresholding (NIHT) [129], atomic decomposition for minimum rank approximation (Admira) [130], composite optimization (e.g. prox-linear algorithm) [131][132][133][134], approximate message passing [135][136][137], block coordinate descent [19], coordinate descent [138], and conjugate gradient [139]. The readers are referred to these papers for detailed descriptions.…”

Section: Further Pointers To Other Algorithmsmentioning

confidence: 99%

“…Leaving out this boundedness issue, a more severe matter is that these parameters often depend on the problem size. For example, recall that in [27,91] vanilla GD [18, 26, 58-60, 75, 90, 201, 202] [ 46,48,49,95,165,166,203,204] Phase retrieval regularized GD [20,24,79,80,87,87,205] and quadratic sensing alternating minimization [118][119][120]206] approximate message passing [136,137] matrix completion [29,159,160,207] vanilla gradient descent [26] [1, 11, 13, 208-211] regularized Grassmanian GD [16,65] projected / regularized GD [19,21,63] alternating minimization [17,123,124] blind deconvolution / demixing vanilla GD [26,62] [5, 6] (subspace model) regularized GD [22,66,212] robust PCA [159] low-rank projection + thresholding [25] [7, 8, 213, 214] GD + thresholding…”

Section: Cautionmentioning

confidence: 99%

See 2 more Smart Citations

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Chi

Chen

2019

IEEE Trans. Signal Process.

336

284

View full text Add to dashboard Cite

Substantial progress has been made recently on developing provably accurate and efficient algorithms for low-rank matrix factorization via nonconvex optimization. While conventional wisdom often takes a dim view of nonconvex optimization algorithms due to their susceptibility to spurious local minima, simple iterative methods such as gradient descent have been remarkably successful in practice. The theoretical footings, however, had been largely lacking until recently.In this tutorial-style overview, we highlight the important role of statistical models in enabling efficient nonconvex optimization with performance guarantees. We review two contrasting approaches: (1) two-stage algorithms, which consist of a tailored initialization step followed by successive refinement; and (2) global landscape analysis and initialization-free algorithms. Several canonical matrix factorization problems are discussed, including but not limited to matrix sensing, phase retrieval, matrix completion, blind deconvolution, robust principal component analysis, phase synchronization, and joint alignment. Special care is taken to illustrate the key technical insights underlying their analyses. This article serves as a testament that the integrated consideration of optimization and statistics leads to fruitful research findings.

show abstract

“…Recall that E[Y ] = 2x x + x 2 2 I n has a bounded spectral norm, then (136) implies that, to keep the deviation between Y and E[Y ] well-controlled, we must at least have (n log m)/m 1.…”

Section: Truncated Spectral Methods For Sample Efficiencymentioning

confidence: 99%

Section: Further Pointers To Other Algorithmsmentioning

confidence: 99%

Section: Cautionmentioning

confidence: 99%

See 1 more Smart Citation

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Chi

Chen

2019

IEEE Trans. Signal Process.

336

284

View full text Add to dashboard Cite

show abstract

“…Almost all of these nonconvex methods require carefully-designed initialization to guarantee a sufficiently accurate initial point. One exception is the approximate message passing algorithm proposed in [MXM18], which works as long as the correlation between the truth and the initial signal is bounded away from zero. This, however, does not accommodate the case when the initial signal strength is vanishingly small (like random initialization).…”

Section: Related Workmentioning

confidence: 99%

Gradient descent with random initialization: fast global convergence for nonconvex phase retrieval

et al. 2019

View full text Add to dashboard Cite

This paper considers the problem of solving systems of quadratic equations, namely, recovering an object of interest x ∈ R n from m quadratic equations / samples yi = (a i x ) 2 , 1 ≤ i ≤ m. This problem, also dubbed as phase retrieval, spans multiple domains including physical sciences and machine learning.We investigate the efficacy of gradient descent (or Wirtinger flow) designed for the nonconvex least squares problem. We prove that under Gaussian designs, gradient descent -when randomly initialized -yields an -accurate solution in O log n + log(1/ ) iterations given nearly minimal samples, thus achieving near-optimal computational and sample complexities at once. This provides the first global convergence guarantee concerning vanilla gradient descent for phase retrieval, without the need of (i) carefully-designed initialization, (ii) sample splitting, or (iii) sophisticated saddle-point escaping schemes. All of these are achieved by exploiting the statistical models in analyzing optimization algorithms, via a leave-one-out approach that enables the decoupling of certain statistical dependency between the gradient descent iterates and the data.

show abstract

“…An active line of recent work studies nonconvex optimization algorithms for solving the classical phase retrieval problem (see, e.g., [1]- [8]). Compared to methods using convex relaxation [9]- [12], the nonconvex approaches tend to require much lower computational complexity and memory footprints.…”

Section: Introductionmentioning

confidence: 99%

Optimal Spectral Initialization for Signal Recovery With Applications to Phase Retrieval

Luo

Alghamdi

2019

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

We present the optimal design of a spectral method widely used to initialize nonconvex optimization algorithms for solving phase retrieval and other signal recovery problems. Our work leverages recent results that provide an exact characterization of the performance of the spectral method in the highdimensional limit. This characterization allows us to map the task of optimal design to a constrained optimization problem in a weighted L 2 function space. The latter has a closed-form solution.Interestingly, under a mild technical condition, our results show that there exists a fixed design that is uniformly optimal over all sampling ratios. Numerical simulations demonstrate the performance improvement brought by the proposed optimal design over existing constructions in the literature. In a recent work, Mondelli and Montanari have shown the existence of a weak reconstruction threshold below which the spectral method cannot provide useful estimates. Our results serve to complement that work by deriving the fundamental limit of the spectral method beyond the aforementioned threshold.

show abstract

Optimization-Based AMP for Phase Retrieval: The Impact of Initialization and $\ell_{2}$ Regularization

Cited by 51 publications

References 66 publications

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Gradient descent with random initialization: fast global convergence for nonconvex phase retrieval

Optimal Spectral Initialization for Signal Recovery With Applications to Phase Retrieval

Contact Info

Product

Resources

About