“…Most research focuses on supervised learning [3, 15, 19, 29, 45-47, 49, 58] or utilises unsupervised learning to label the data and then applies a supervised learner to predict the label [2,33]. Moreover, the classification usually falls within two paradigms: account [3,15,19,33,46,47,58] or transaction level [26,49] [3,19,26,30,45,49,58]. However, the most common models are DT ensembles, more particularly RF [2, 3, 19, 26, 45-47, 47, 49, 58] and gradient boosting (e.g., XGBoost and LGBM) [15,19,30,45,46,58].…”