Finite size scaling of the Bayesian perceptron

Buhot, Arnaud; Torres‐Moreno, Juan‐Manuel; Gordon, Mirta B.

doi:10.1103/physreve.55.7434

Cited by 9 publications

(13 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The latter quantity itself is influenced by finite size effects. Extensive numerical simulations show that the corrections are linear in 1/N [24][25][26] and hence they are negligible after clipping and getting ρ W (As in Eq. 12).…”

Section: Finite Systems -Perfect Learningmentioning

confidence: 99%

Training a perceptron in a discrete weight space

Rosen-Zvi

Kanter

2001

Phys. Rev. E

View full text Add to dashboard Cite

On-line and batch learning of a perceptron in a discrete weight space, where each weight can take 2L + 1 different values, are examined analytically and numerically. The learning algorithm is based on the training of the continuous perceptron and prediction following the clipped weights. The learning is described by a new set of order parameters, composed of the overlaps between the teacher and the continuous/clipped students. Different scenarios are examined among them on-line learning with discrete/continuous transfer functions and off-line Hebb learning. The generalization error of the clipped weights decays asymptotically as exp(−Kα 2 )/exp(−e |λ|α ) in the case of on-line learning with binary/continuous activation functions, respectively, where α is the number of examples divided by N, the size of the input vector and K is a positive constant that decays linearly with 1/L. For finite N and L, a perfect agreement between the discrete student and the teacher is obtained for α ∝ L ln(N L). A crossover to the generalization error ∝ 1/α, characterized continuous weights with binary output, is obtained for synaptic depth L > O( √ N ).

show abstract

Section: Finite Systems -Perfect Learningmentioning

confidence: 99%

Training a perceptron in a discrete weight space

Rosen-Zvi

Kanter

2001

Phys. Rev. E

View full text Add to dashboard Cite

show abstract

“…The variational optimization of R with respect to the choice of V can now be performed as in refs. [15,8,16,17] invoking the Schwarz inequality. We only quote the final result for the resulting overlap R opt at the minimum of this optimal potential:…”

mentioning

confidence: 99%

Bayes-optimal performance in a discrete space

Copelli¹,

Broeck²,

Opper³

1999

J. Phys. A: Math. Gen.

View full text Add to dashboard Cite

show abstract

“…Both these corrections are of order O(1/ √ n). This behaviour, numerically verified within several learning scenarios (Buhot, Torres Moreno, & Gordon, 1997;Nadler & Fink, 1997;Schroder & Urbanczik, 1998), shows that the predictions of the statistical mechanics approach are better for larger n.…”

Section: Statistical Mechanicsmentioning

confidence: 82%

Untitled

Risau-Gusmán

Gordon

2002

Machine Learning

Self Cite

View full text Add to dashboard Cite

Abstract.We study the typical properties of polynomial Support Vector Machines within a Statistical Mechanics approach that takes into account the number of high order features relative to the input space dimension. We analyze the effect of different features' normalizations on the generalization error, for different kinds of learning tasks. If the normalization is adequately selected, hierarchical learning of features of increasing order takes place as a function of the training set size. Otherwise, the performance worsens, and there is no hierarchical learning at all.

show abstract

Finite size scaling of the Bayesian perceptron

Cited by 9 publications

References 13 publications

Training a perceptron in a discrete weight space

Training a perceptron in a discrete weight space

Bayes-optimal performance in a discrete space

Untitled

Contact Info

Product

Resources

About