10.1162/1532443041827952

Hutter, Marcus

doi:10.1162/1532443041827952

Cited by 7 publications

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Sample Complexity for Computational Classification Problems

Ryabko

2007

Algorithmica

View full text Add to dashboard Cite

In a statistical setting of the classification (pattern recognition) problem the number of examples required to approximate an unknown labelling function is linear in the VC dimension of the target learning class. In this work we consider the question of whether such bounds exist if we restrict our attention to computable classification methods, assuming that the unknown labelling function is also computable. We find that in this case the number of examples required for a computable method to approximate the labelling function not only is not linear, but grows faster (in the VC dimension of the class) than any computable function. No time or space constraints are put on the predictors or target functions; the only resource we consider is the training examples.The task of classification is considered in conjunction with another learning problem-data compression. An impossibility result for the task of data compression allows us to estimate the sample complexity for pattern recognition.

show abstract

Sample Complexity for Computational Classification Problems

Ryabko

2007

Algorithmica

View full text Add to dashboard Cite

show abstract

MDL convergence speed for Bernoulli sequences

Poland

Hutter

2006

Stat Comput

View full text Add to dashboard Cite

The Minimum Description Length principle for online sequence estimation/prediction in a proper learning setup is studied. If the underlying model class is discrete, then the total expected square loss is a particularly interesting performance measure: (a) this quantity is finitely bounded, implying convergence with probability one, and (b) it additionally specifies the convergence speed. For MDL, in general one can only have loss bounds which are finite but exponentially larger than those for Bayes mixtures. We show that this is even the case if the model class contains only Bernoulli distributions. We derive a new upper bound on the prediction error for countable Bernoulli classes. This implies a small bound (comparable to the one for Bayes mixtures) for certain important model classes. We discuss the application to Machine Learning tasks such as classification and hypothesis testing, and generalization to countable classes of i.i.d. models

show abstract

Algorithmic complexity bounds on future prediction errors

Chernov

Hutter

Schmidhuber

2007

Information and Computation

View full text Add to dashboard Cite

We bound the future loss when predicting any (computably) stochastic sequence online. Solomonoff finitely bounded the total deviation of his universal predictor M from the true distribution μ by the algorithmic complexity of μ . Here we assume that we are at a time t>1 and have already observed x = x 1...xt . We bound the future prediction performance on xt+1xt+2... by a new variant of algorithmic complexity of μ given x, plus the complexity of the randomness deficiency of x. The new complexity is monotone in its condition in the sense that this complexity can only decrease if the condition is prolonged. We also briefly discuss potential generalizations to Bayesian model classes and to classification problems

show abstract

10.1162/1532443041827952

Cited by 7 publications

References 42 publications

Sample Complexity for Computational Classification Problems

Sample Complexity for Computational Classification Problems

MDL convergence speed for Bernoulli sequences

Algorithmic complexity bounds on future prediction errors

Contact Info

Product

Resources

About