2021
DOI: 10.31449/inf.v45i2.3479
|View full text |Cite
|
Sign up to set email alerts
|

Impact of Data Balancing During Training for Best Predictions

Abstract: To protect the middle class from over-indebtedness, banking institutions need to implement a flexible analytic-based evaluation method to improve the banking process by detecting customers who are likely to have difficulty in managing their debt. In this paper, we test and evaluate a large variety of data balancing methods on selected machine learning algorithms (MLAs) to overcome the effects of imbalanced data and show their impact on the training step to predict credit risk. Our objective is to deal with dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…It was, however, noted that these conventional methods for combating fraud in MMS are ineffective due to the problems of cybercrime [28]. There are many difficulties with the procedures, rules, and measures for MMS to offer tools for curbing cybercrime threats because no practical solution has been offered, particularly in the context of developing countries, as demonstrated by the survey conducted in [29].…”
Section: Related Workmentioning
confidence: 99%
“…It was, however, noted that these conventional methods for combating fraud in MMS are ineffective due to the problems of cybercrime [28]. There are many difficulties with the procedures, rules, and measures for MMS to offer tools for curbing cybercrime threats because no practical solution has been offered, particularly in the context of developing countries, as demonstrated by the survey conducted in [29].…”
Section: Related Workmentioning
confidence: 99%
“…Synthetic Minority Over-sampling Technique (SMOTE) algorithm is one of the well-known augmentation techniques that are used in imbalanced datasets to solve minority class problems. In the imbalanced dataset, there are too few instances of minority classes that affect model decisions [25].…”
Section: Using Data Augmentation Techniquementioning
confidence: 99%
“…Given any dataset on financial transactions such as MMTs, the number of fraudulent transactions (positive class) compared to the legitimate (negative class) ones, constitutes a very small percentage of the dataset. This makes the datasets highly imbalanced [6] and predictions from such data using machine learning algorithms are skewed towards the legitimate transactions with the long term effect that predictions made with such data can be misleading. We select Logistic Regression as the machine learning algorithm for this work as it has proven its potency [7] in a multitude of fields for classification and prediction.…”
Section: Introductionmentioning
confidence: 99%