2006
DOI: 10.1007/s10994-006-8364-x
|View full text |Cite
|
Sign up to set email alerts
|

MODL: A Bayes optimal discretization method for continuous attributes

Abstract: While real data often comes in mixed format, discrete and continuous, many supervised induction algorithms require discrete data. Efficient discretization of continuous attributes is an important problem that has effects on speed, accuracy and understandability of the induction models. In this paper, we propose a new discretization method MODL 1 , founded on a Bayesian approach. We introduce a space of discretization models and a prior distribution defined on this model space. This results in the definition of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
108
0
1

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
4
1

Relationship

4
6

Authors

Journals

citations
Cited by 131 publications
(109 citation statements)
references
References 17 publications
0
108
0
1
Order By: Relevance
“…In the MODL approach [4], the discretization is turned into a model selection problem and solved in a Bayesian way. Using a hierarchical prior distribution on the discretization parameters, the Bayes formula is applicable to derive an exact analytical criterion to evaluate the posterior probability of a discretization model.…”
Section: A Optimal Preprocessingmentioning
confidence: 99%
“…In the MODL approach [4], the discretization is turned into a model selection problem and solved in a Bayesian way. Using a hierarchical prior distribution on the discretization parameters, the Bayes formula is applicable to derive an exact analytical criterion to evaluate the posterior probability of a discretization model.…”
Section: A Optimal Preprocessingmentioning
confidence: 99%
“…Figure 1 shows an example of discretization into three intervals, for the Sepal width input variable of the Iris dataset [8]. Total 57 57 36 Versicolor 34 15 1 Virginica 21 24 5 Setosa 2 18 In the MODL approach [9], the discretization is turned into a model selection problem and solved in a Bayesian way. First, a space of discretization models is defined.…”
Section: A Optimal Discretizationmentioning
confidence: 99%
“…Experiments demonstrate that even a simple equal width discretization brings superior performance compared to the assumption using a Gaussian distribution per class. In the MODL approach [7], the discretization is turned into a model selection problem and solved in a Bayesian way. First, a space of discretization models is defined.…”
Section: A Optimal Discretizationmentioning
confidence: 99%