“…While they offered important insights into the benign overfitting phenomenon, most of them are limited to the settings of linear models (Belkin et al, 2019b;Bartlett et al, 2020;Hastie et al, 2019;Wu and Xu, 2020;Chatterji and Long, 2020;Zou et al, 2021b;Cao et al, 2021) and kernel/random features models (Belkin et al, 2018;Liang and Rakhlin, 2020;Montanari and Zhong, 2020), and cannot be applied to neural network models that are of greater interest. The only notable exceptions are (Adlam and Pennington, 2020;Li et al, 2021), which attempted to understand benign overfitting in neural network models. However, they are still limited to the "neural tagent kernel regime" (Jacot et al, 2018) where the neural network learning problem is essentially equivalent to kernel regression.…”