“…Letting x-axis as the iteration counter, we plot the 50 sample paths for each algorithm in Figure 3. The step sizes for S-OGDA, SMD and SMP are selected as in [11], [28] and [18], respectively. Specifically, other than SAPD, both the primal and dual step sizes are set equal, and their value is a function of L = max{µ x , µ y , L xy , L yx }; indeed, S-OGDA uses 1 8L , SMP uses 1 √ 3L , and SMD uses 2 √ 5GN , where N denotes the total iteration budget for SMD, and G > 0 is a fixed constant such that E[2 ∇L(x, y; ω x , ω y ) 2 ] ≤ G uniformly for all (x, y) ∈ X × Y .…”