“…Distributed optimization: In recent years, a lot of effort has been devoted to designing distributed first-order methods (Mahajan et al, 2013;Shamir and Srebro, 2014;Lee et al, 2017;Fercoq and Richtárik, 2016;Liu et al, 2014;Necoara and Clipici, 2016;Richtárik and Takáč, 2016;Liu et al, 2020), which only rely on gradient information of the objective function. However, first-order methods suffer from: (i) a dependence on a suitably defined condition number; (ii) spending more time on communication than on computation.…”