“…When the number of parameters in the models is excessively large, there are multiple techniques to precisely measure generalization errors. To name a few, the spectrum-based analysis [45,46,47,48,49,50,51,52], and the utilization of loss functions whose shapes are almost convex or approaches zero due to the excess parameters [53,54,55]. A disadvantage of this approach is that until now it can only deal with linear or two-layer neural network models.…”