1996
DOI: 10.1109/9.489276
|View full text |Cite
|
Sign up to set email alerts
|

Learning by canonical smooth estimation. II. Learning and choice of model complexity

Abstract: In this paper, we analyze the properties of a procedure for learning from examples. This \canonical learner" is based on a canonical error estimator developed in a companion paper. In learning problems, we observe data that consists of labeled sample points, and the goal is to nd a model, or \hypothesis," from a set of candidates that will accurately predict the labels of new sample points. The expected mismatch between a hypothesis' prediction and the actual label of a new sample point is called the hypothesi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

1997
1997
2006
2006

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 26 publications
(3 citation statements)
references
References 46 publications
0
3
0
Order By: Relevance
“…In this paper, we focus on random penalty functions. This subject has been tackled by Buescher and Kumar (1996), Lugosi and Nobel (1999) and Boucheron et al (2000) but the most interesting works for our approach are the ones due to Koltchinskii (2001) and Bartlett et al (2002). Let ξ denote the sample (X 1 , Y 1 ), .…”
Section: Theorem 1 (Mcdiarmid)mentioning
confidence: 99%
See 1 more Smart Citation
“…In this paper, we focus on random penalty functions. This subject has been tackled by Buescher and Kumar (1996), Lugosi and Nobel (1999) and Boucheron et al (2000) but the most interesting works for our approach are the ones due to Koltchinskii (2001) and Bartlett et al (2002). Let ξ denote the sample (X 1 , Y 1 ), .…”
Section: Theorem 1 (Mcdiarmid)mentioning
confidence: 99%
“…This remark naturally leads to the idea of data-driven penalization. Buescher and Kumar (1996), Lugosi and Nobel (1999), Boucheron et al (2000) introduce some penalties involving the related empirical coverings or empirical Shatter coefficients. Inspired by the method of Rademacher symmetrization commonly used in the empirical processes theory (see for instance Van der Vaart and Wellner, 1996), Koltchinskii (2001) and Bartlett et al (2002) independently propose the so-called Rademacher penalties which are based on random variables of the form:…”
mentioning
confidence: 99%
“…Since jXj = m, there exists an x 2 X such that this property holds for at least 72k pairs. For j 2 f 1 : : : b g, de ne By the pigeonhole principle, there are at least 72k= 9 2 = 2 k pairs (h 1 h 2 ) for which the set f (h 1 (x) h 2 (x))g is the same. Then it follows that there are two subclasses H 1 H 2 H and indeces i j 2 f 1 : : : 9g with jH 1 j = jH 2 j = 2 k such that for each h 1 2 H 1 , (h 1 (x)) = i, f o r each h 2 2 H 2 , (h 2 (x)) = j, a n d i j + 4.…”
Section: Proof Of Theoremmentioning
confidence: 99%