2004
DOI: 10.1023/b:amai.0000018580.96245.c6
|View full text |Cite
|
Sign up to set email alerts
|

Theoretical Comparison between the Gini Index and Information Gain Criteria

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
245
0
3

Year Published

2011
2011
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 549 publications
(253 citation statements)
references
References 15 publications
5
245
0
3
Order By: Relevance
“…These values were accompanied by a label for the correct methylation state reported by the upstream marker on the same DNA strand. The Gini importance was calculated for each feature in a specific context using a forest of extremely randomized trees (21,23).…”
Section: Methodsmentioning
confidence: 99%
“…These values were accompanied by a label for the correct methylation state reported by the upstream marker on the same DNA strand. The Gini importance was calculated for each feature in a specific context using a forest of extremely randomized trees (21,23).…”
Section: Methodsmentioning
confidence: 99%
“…Gini(S )'s minimum is 0, all of the members in the set belong to the same class, indicating that the maximum useful information can be obtained. When all of the samples in the set distribute equally for each class, Gini(S ) is at its maximum, indicating that the minimum useful information can be obtained [6], [12], [13]. However, most studies of Gini-Index have been used only for splitting attributes in a decision tree.…”
Section: Gini-index Theory For Feature Selectionmentioning
confidence: 99%
“…Chandra et al 2009), among still others [1], [6], [8], [10], [15]. And several researchers have indicated that feature selection was biased towards attributes with a large number of possible values, having more values, a larger number of categories, multiple-valued attributes, a large number of missing values, etc, and many studies on unbiased split selection have been introduced [6]. Recently, Carolin Strobl et al (2007) introduced unbiased split selection for classification trees based on the Gini-Index and a new split selection criterion that avoids variable selection bias on standard impurity measures, and Marco Sandri (2008) presented a simple and effective method for bias correction focused on the easily generalizable case of the Gini-Index [3], [7].…”
Section: Introductionmentioning
confidence: 98%
See 1 more Smart Citation
“…The advantage of such an approach, while it can be computationally more expensive, is that it captures more information about the cluster. This is akin to classification with the Gini index or information gain heuristics [16], which summarize a set of examples by means of its class distribution instead of the majority class. We plan to incorporate different types of constraints in our models.…”
Section: Future Workmentioning
confidence: 99%