2018
DOI: 10.1007/978-981-13-2206-8_1
|View full text |Cite
|
Sign up to set email alerts
|

Classifying DNA Methylation Imbalance Data in Cancer Risk Prediction Using SMOTE and Tomek Link Methods

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 10 publications
0
9
0
Order By: Relevance
“…8 For example, effective prediction of common diseases and cancer risks by using SMOTE + TLTE for imbalanced medical data was observed in recent studies. 23,39 Similarly, here the drinking water quality prediction performance of all four traditional and three ensemble learning models was improved by training by the data set optimized by SMOTE + TLTE and SMOTE + ENNTE. One important reason is that SMOTE can form some new minority samples by interpolating between multiple minority samples, thereby radically increasing the proportion of minority samples.…”
Section: Discussionmentioning
confidence: 99%
“…8 For example, effective prediction of common diseases and cancer risks by using SMOTE + TLTE for imbalanced medical data was observed in recent studies. 23,39 Similarly, here the drinking water quality prediction performance of all four traditional and three ensemble learning models was improved by training by the data set optimized by SMOTE + TLTE and SMOTE + ENNTE. One important reason is that SMOTE can form some new minority samples by interpolating between multiple minority samples, thereby radically increasing the proportion of minority samples.…”
Section: Discussionmentioning
confidence: 99%
“…The previous studies showed that the performance of statistical models fitted with balanced data showed better results than the imbalanced data. When statistical models are fitted with imbalanced data, they are often biased towards the majority class and show poor performance in predicting the minority class [27][28][29]. Nevertheless, it is impossible to say how much imbalance in the distribution of the classes affects classification performance because other elements like sample size, relevance of predictor variables, etc.…”
Section: Imbalance Datamentioning
confidence: 99%
“…Kamarulzalis et al [14] apply SMOTE to gender analysis, in which J48 is used as the classifier. Liu et al [15] apply SMOTE-TL to cancer risk prediction. Nakamura et al [16] propose a novel SMOTEbased method using codebooks obtained by the learning vector quantization, and then apply the proposed SMOTE in biomedical data.…”
Section: Related Workmentioning
confidence: 99%
“…Hybrid techniques, such as S-SulfPred [12] and SSOMaj-SMOTE-SSOMin [13], are developed and combine oversampling techniques with undersampling techniques. Among oversampling techniques, the Synthetic Minority Over-sampling Technique (SMOTE) [5] is the most successful due to a lot of admiration and extensive practice, such as gender analysis [14], bioengineering [15], medical examination [16], Fraud identification [17].…”
Section: Introductionmentioning
confidence: 99%