2014
DOI: 10.1177/0272989x14560647
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes

Abstract: To determine a classifier with a machine learning algorithm like the PNN and DT, class skew in data should be considered. The NB and DT were optimal classifiers in a prediction task in an imbalanced medical database.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
53
0
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 68 publications
(55 citation statements)
references
References 27 publications
1
53
0
1
Order By: Relevance
“…The second category deals with disease prediction and diagnosis [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65], [66], [67], [68], [69], [70], [71], [72], [73], [74], [75], [76]. Numerous algorithms and different approaches have been applied, such as traditional machine learning algorithms, ensemble learning approaches and association rule learning in order to achieve the best classification accuracy.…”
Section: Dm Through Machine Learning and Data Miningmentioning
confidence: 99%
“…The second category deals with disease prediction and diagnosis [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65], [66], [67], [68], [69], [70], [71], [72], [73], [74], [75], [76]. Numerous algorithms and different approaches have been applied, such as traditional machine learning algorithms, ensemble learning approaches and association rule learning in order to achieve the best classification accuracy.…”
Section: Dm Through Machine Learning and Data Miningmentioning
confidence: 99%
“…Assim, para trabalhos futuros pretende-se testar outras técnicas voltadas para dados desbalanceados, tanto no pré-processamento como na codificação do algoritmo classificador (ou associação de mais de um classificador), com o intuito de encontrar adaptações capazes de evitar viés para as classes majoritárias. Dentre as técnicas citadas na literatura utilizada como referência, conforme Batista, Prati e Monard (2004, 2008; e ainda Ramezankhani et al (2016) …”
Section: Resultsunclassified
“…There are a finite number of solutions to handle imbalanced data sets 21–23. In our previous work, we showed the effectiveness of Synthetic Minority Oversampling Technique (SMOTE) for handling imbalanced data sets 24. In this study, we balanced two training data sets of men and women using SMOTE as previously reported (figure 2).…”
Section: Methodsmentioning
confidence: 93%
“…In this study, we balanced two training data sets of men and women using SMOTE as previously reported (figure 2). 24…”
Section: Methodsmentioning
confidence: 99%