2021
DOI: 10.1109/tifs.2021.3076932
|View full text |Cite
|
Sign up to set email alerts
|

Gini-Impurity Index Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(21 citation statements)
references
References 33 publications
0
21
0
Order By: Relevance
“…The winning feature is chosen in a deterministic manner to maximize the homogeneity in each child node. We use Gini impurity 30 criteria to choose the winner feature and also as a measure of homogeneity in our design. The homogeneity of a node is calculated based on the distribution of data with different class labels in that node.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The winning feature is chosen in a deterministic manner to maximize the homogeneity in each child node. We use Gini impurity 30 criteria to choose the winner feature and also as a measure of homogeneity in our design. The homogeneity of a node is calculated based on the distribution of data with different class labels in that node.…”
Section: Methodsmentioning
confidence: 99%
“…A feature that causes more reduction of information across the trees of a random forest is assigned more importance in our computation. This is called Gini importance 30 . Based on the importance values, we take the top N most important features for the subsequent classification task.…”
Section: Methodsmentioning
confidence: 99%
“…The Gini index is selected as the splitting attribute, and finally the binary tree is generated. 18 Gini coefficient is used to represent the impurity of data set. To represent sample set H , its Gini coefficient can be expressed as follows: where Pi represents the probability that the data in the sample set H belongs to class n .…”
Section: The Review Of Research Methodsmentioning
confidence: 99%
“…In a general way, the best separation is the one that divides the data into groups such that there is a dominant class. To measure that, the algorithm in our experiments uses the Gini diversity index, which is one of the possible impurity measures [40]. The Gini diversity index is a measure of how often a randomly chosen item from a set would be incorrectly labelled if it were randomly labelled according to the distribution of labels in the subset.…”
Section: Machine Learning Modelmentioning
confidence: 99%