“…Leaving out this boundedness issue, a more severe matter is that these parameters often depend on the problem size. For example, recall that in [27,91] vanilla GD [18, 26, 58-60, 75, 90, 201, 202] [ 46,48,49,95,165,166,203,204] Phase retrieval regularized GD [20,24,79,80,87,87,205] and quadratic sensing alternating minimization [118][119][120]206] approximate message passing [136,137] matrix completion [29,159,160,207] vanilla gradient descent [26] [1, 11, 13, 208-211] regularized Grassmanian GD [16,65] projected / regularized GD [19,21,63] alternating minimization [17,123,124] blind deconvolution / demixing vanilla GD [26,62] [5, 6] (subspace model) regularized GD [22,66,212] robust PCA [159] low-rank projection + thresholding [25] [7, 8, 213, 214] GD + thresholding…”