2023
DOI: 10.3390/data8110169
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning for Credit Risk Prediction: A Systematic Literature Review

Jomark Pablo Noriega,
Luis Antonio Rivera,
José Alfredo Herrera

Abstract: In this systematic review of the literature on using Machine Learning (ML) for credit risk prediction, we raise the need for financial institutions to use Artificial Intelligence (AI) and ML to assess credit risk, analyzing large volumes of information. We posed research questions about algorithms, metrics, results, datasets, variables, and related limitations in predicting credit risk. In addition, we searched renowned databases responding to them and identified 52 relevant studies within the credit industry … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 65 publications
0
3
0
Order By: Relevance
“…Therefore, for a solid experimental conclusion, another 4-fold cross-validation experiment was also studied. No sampling (original data) Logistic regression 0.9882 (12) 0.9920 (10) 0.9506 (12) 0.9709 (12) 0.9639 (12) Random forest 0.9979 (4) 0.9999 (3) 0.9902 (6) 0.9951 (4) 0.9938 (4) Gradient boosting 0.9961 (9) 0.9999 (3) 0.9812 (10) 0.9905 (9) 0.9882 (9) Over-sampling Logistic regression 0.9914 (11) 0.9895 (12) 0.9685 (11) 0.9789 (11) 0.9736 (11) Random forest 0.9999 (2) 1.0000 (1) 0.9999 (2) 0.9999 (2) 0.9999 (2) Gradient boosting 1.0000 (1) 1.0000 (1) 1.0000 (1) 1.0000 (1) 1.0000 (1) Under-sampling Logistic regression 0.9950 (10) 0.9900 (11) 0.9856 (9) 0.9878 (10) 0.9847 (10) Random forest 0.9986 (3) 0.9989 (8) 0.9940 (3) 0.9965 (3) 0.9956 (3) Gradient boosting 0.9979 (4) 0.9992 (7) 0.9908 (4) 0.9950 (5) 0.9937 (5) Combined sampling Logistic regression 0.9966…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, for a solid experimental conclusion, another 4-fold cross-validation experiment was also studied. No sampling (original data) Logistic regression 0.9882 (12) 0.9920 (10) 0.9506 (12) 0.9709 (12) 0.9639 (12) Random forest 0.9979 (4) 0.9999 (3) 0.9902 (6) 0.9951 (4) 0.9938 (4) Gradient boosting 0.9961 (9) 0.9999 (3) 0.9812 (10) 0.9905 (9) 0.9882 (9) Over-sampling Logistic regression 0.9914 (11) 0.9895 (12) 0.9685 (11) 0.9789 (11) 0.9736 (11) Random forest 0.9999 (2) 1.0000 (1) 0.9999 (2) 0.9999 (2) 0.9999 (2) Gradient boosting 1.0000 (1) 1.0000 (1) 1.0000 (1) 1.0000 (1) 1.0000 (1) Under-sampling Logistic regression 0.9950 (10) 0.9900 (11) 0.9856 (9) 0.9878 (10) 0.9847 (10) Random forest 0.9986 (3) 0.9989 (8) 0.9940 (3) 0.9965 (3) 0.9956 (3) Gradient boosting 0.9979 (4) 0.9992 (7) 0.9908 (4) 0.9950 (5) 0.9937 (5) Combined sampling Logistic regression 0.9966…”
Section: Resultsmentioning
confidence: 99%
“…Machine learning techniques have several benefits in various applications, especially in the form of predicting a trend or outcome. Hence, machine learning models can accurately assess credit default probabilities and improve credit risk prediction [1]. Focusing on financial services like personal loans, accurately predicting the risk of non-performing loans (NPLs) in peer-to-peer (P2P) lending is one crucial thing for lenders such as P2P lending platforms.…”
Section: Introductionmentioning
confidence: 99%
“…As a result, some of the loan received by entrepreneurs does not bear fruit. Many researchers in the existing literature have tried to predict whether or not a person can be granted loan based on certain criteria and characteristics of the person using machine learning (ML) (Guan et al, 2023;Noriega et al, 2023;. Unfortunately, the literature review revealed that there is a lack of research based on the prediction of loan fructification by entrepreneurs.…”
Section: Introductionmentioning
confidence: 99%