1968
DOI: 10.1093/comjnl/11.2.185
|View full text |Cite
|
Sign up to set email alerts
|

An Information Measure for Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
527
0
2

Year Published

1975
1975
2008
2008

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 817 publications
(530 citation statements)
references
References 5 publications
1
527
0
2
Order By: Relevance
“…The Minimum Message Length (MML) Principle (Wallace & Boulton, 1968;Comley & Dowe, 2005;Wallace, 2005) is a method for inductive inference that is both Bayesian and compression-based. The similarities and differences with MDL are subtle; see, for example, Section 10.2 of Wallace (2005) or Section 17.4 of Grünwald (2007), or (Wallace & Dowe, 1999a,b).…”
Section: What About Mml?mentioning
confidence: 99%
“…The Minimum Message Length (MML) Principle (Wallace & Boulton, 1968;Comley & Dowe, 2005;Wallace, 2005) is a method for inductive inference that is both Bayesian and compression-based. The similarities and differences with MDL are subtle; see, for example, Section 10.2 of Wallace (2005) or Section 17.4 of Grünwald (2007), or (Wallace & Dowe, 1999a,b).…”
Section: What About Mml?mentioning
confidence: 99%
“…The fundamental idea is that compact coding theory provides the right framework in which to think about inference and prediction (Wallace and Boulton [1968]; Wallace and Freeman [1987]; Wallace and Dowe [1999a]; Wallace [2005] (Wallace and Freeman [1987]) is derived from a quadratic Taylor series approximation of Strict MML, and shares many of the desirable features of Strict MML.…”
Section: The Minimum Message Length (Mml) Principlementioning
confidence: 99%
“…The clustering is performed in advance of the prediction process by the clustering program Snob [3], using the content lemmas (unigrams) in the answers as features. The predictive model is a Decision Graph [4] trained on (1) input features: lemma unigrams and bigrams in the request, 3 and (2) target feature -the identifier of the answer cluster that contains the actual answer for the request.…”
Section: Predict a Complete Answermentioning
confidence: 99%
“…The predictive model is a Decision Graph [4] trained on (1) input features: lemma unigrams and bigrams in the request, 3 and (2) target feature -the identifier of the answer cluster that contains the actual answer for the request. The model provides a prediction of which response cluster is most suitable for a given request, as well as a level of confidence in this prediction.…”
Section: Predict a Complete Answermentioning
confidence: 99%