“…This is an improvement over (2.4), which can be seen by comparing the infimum over γ of the expression in the bound with the value of the expression for γ = 1 and noting that d(f ; 1) = 0. For example, if the weights decrease polynomially |λ j | ∼ j −β , β > 1, or exponentially |λ j | ∼ e −βj , β > 0, then explicit minimization over γ shows that in these cases (2.8) can be a substantial improvement over (2.4) (see examples in [21]). Our first result in this paper also deals with bounding the generalization error of a classifier f = T j=1 λ j h j ∈ F = conv(H) in terms of complexity measures taking into account the sparsity of the weights λ j .…”