2016
DOI: 10.1016/j.datak.2015.11.002
|View full text |Cite
|
Sign up to set email alerts
|

An instance selection method for large datasets based on Markov Geometric Diffusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…Due to this problem, minority class samples are less attended to, affecting incorrect classification results [ 18 ]. The classification error of an unbalanced data set is exacerbated by the limited number of samples and a large number of features [ 20 , 21 ]. Therefore, it is necessary to consider selecting an appropriate analysis model based on such unbalanced data in computer modeling.…”
Section: Discussionmentioning
confidence: 99%
“…Due to this problem, minority class samples are less attended to, affecting incorrect classification results [ 18 ]. The classification error of an unbalanced data set is exacerbated by the limited number of samples and a large number of features [ 20 , 21 ]. Therefore, it is necessary to consider selecting an appropriate analysis model based on such unbalanced data in computer modeling.…”
Section: Discussionmentioning
confidence: 99%
“…Data reduction may be in terms of the number of rows (instances) or in terms of the number of columns (features) ( Aggarwal, 2015 ). In this sense, three main approaches have been proposed: (1) feature selection ( Cheng, Cai, Zhang, Xu, & Su, 2015;Ganapathi & Duraivelu, 2015;Xia, Fang, & Zhang, 2014 ), (2) instance selection ( García, Luengo, & Herrera, 2015;de Oliveira Moura, de Freitas, Cardoso, & Cavalcanti, 2014;Silva, Souza, & Motta, 2016 ) and (3) hybrid, where feature selection and instance selection are combined ( Chen, Zhang, Jin, & Kim, 2014 ).…”
Section: Related Workmentioning
confidence: 99%
“…This problem causes underestimation of the minority class examples and produces bias and inaccurate classification results toward the majority class examples [ 1 ]. Classification of an imbalanced data set becomes more severe with limited number samples and a huge number of features [ 3 , 4 ].…”
Section: Introductionmentioning
confidence: 99%