“…Due to the empirical success of neural networks, there has been much effort to understand under what assumptions neural networks may be learned efficiently. This effort includes making assumptions on the input distribution [Li and Yuan, 2017, Brutzkus and Globerson, 2017, Du et al, 2017a,b, Du and Goel, 2018, the network's weights [Arora et al, 2014, Das et al, 2019, Agarwal et al, 2020, Goel and Klivans, 2017, or both [Janzamin et al, 2015, Tian, 2017, Bakshi et al, 2019. Hence, distribution-specific learning of neural networks is a central problem.…”