“…See also (Holden, 1994;Wang et al, 1994;Amari and Murata, 1993;Wang et al, 1994;Guyon et al, 1992;Vapnik, 1992;Wolpert, 1994). Similar priors (or biases towards simplicity) are implicit in constructive and pruning algorithms, e.g., layer-by-layer sequential network construction (e.g., Ivakhnenko, 1968Ivakhnenko, , 1971Ash, 1989;Moody, 1989;Gallant, 1988;Honavar and Uhr, 1988;Ring, 1991;Fahlman, 1991;Weng et al, 1992;Honavar and Uhr, 1993;Burgess, 1994;Fritzke, 1994;Parekh et al, 2000;Utgoff and Stracuzzi, 2002) (see also Sec. 5.3, 5.11), input pruning (Moody, 1992;Refenes et al, 1994), unit pruning (e.g., Ivakhnenko, 1968Ivakhnenko, , 1971White, 1989;Mozer and Smolensky, 1989;Levin et al, 1994), weight pruning, e.g., optimal brain damage (LeCun et al, 1990b), and optimal brain surgeon (Hassibi and Stork, 1993).…”