2021
DOI: 10.14569/ijacsa.2021.0120110
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the Depression of the South Korean Elderly using SMOTE and an Imbalanced Binary Dataset

Abstract: Since the number of healthy people is much more than that of ill people, it is highly likely that the problem of imbalanced data will occur when predicting the depression of the elderly living in the community using big data. When raw data are directly analyzed without using supplementary techniques such as a sample algorithm for datasets, which have imbalanced class ratios, it can decrease the performance of machine learning by causing prediction errors in the analysis process. Therefore, it is necessary to u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 35 publications
0
10
0
Order By: Relevance
“…If there is an imbalance of classes, the group with a larger number of data is treated as more important, and the predictive performance decreases. Undersampling, oversampling, and synthetic minority over-sampling technique (SMOTE) methods are mainly used to deal with data imbalance[ 17 ], and it has been reported that the performance of SMOTE is generally better than that of undersampling and oversampling[ 18 ].…”
Section: Types Of Machine Learningmentioning
confidence: 99%
“…If there is an imbalance of classes, the group with a larger number of data is treated as more important, and the predictive performance decreases. Undersampling, oversampling, and synthetic minority over-sampling technique (SMOTE) methods are mainly used to deal with data imbalance[ 17 ], and it has been reported that the performance of SMOTE is generally better than that of undersampling and oversampling[ 18 ].…”
Section: Types Of Machine Learningmentioning
confidence: 99%
“…The third domain believed to affect subjective health is economic activity. Economic activity makes earning and life, in general, more energetic, as the daily routine encourages people to become more diligent [ 39 , 40 , 41 , 42 ]. Previous studies have demonstrated a link between economic activity and subjective health.…”
Section: Review Of Literature and Hypotheses Developmentmentioning
confidence: 99%
“…The imbalanced data problem exists in many datasets; as a result, classifiers models are biased against the minority class and are unable to predict it accurately [13]. In contrast, most machine learning models perform better when applied with balanced datasets [14,15,16,17].…”
Section: A Data-level Approach and Imbalanced Datamentioning
confidence: 99%
“…Since the sample size grows, the oversampling technique takes longer to construct a model and can cause overfitting because it duplicates samples from a minor class. [23,24]. b) SMOTE: SMOTE is similar to random oversampling.…”
Section: ) Over-sampling Techniquementioning
confidence: 99%