2023
DOI: 10.1111/acfi.13044
|View full text |Cite
|
Sign up to set email alerts
|

Predicting accounting fraud using imbalanced ensemble learning classifiers – evidence from China

Abstract: The current research aims to launch effective accounting fraud detection models using imbalanced ensemble learning algorithms for China A‐Share listed firms. Based on a sample of 33,544 Chinese firm‐year instances from 1998 to 2017, this research respectively established one logistic regression and four ensemble learning classifiers (AdaBoost, XGBoost, CUSBoost, and RUSBoost) by 12 financial ratios and 28 raw financial data. Additionally, we divided the sample into the train and test observations to evaluate t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 116 publications
0
7
0
Order By: Relevance
“…We further identify a list of 28 raw financial data items from corporation financial statements which are used to calculate above selected 12 financial indicators, following Sun et al (2021) and Rahman & Zhu (2023). As shown in Table 2, there are 21 items from the Balance Sheet (i.e., Current assets; Cash; Short‐term investments; Current liabilities; Debt in Current Liabilities; Total assets; Available‐for‐sale securities; Held‐to‐maturity securities; Long term equity securities; Total liabilities; Long‐term debt; Debt in current liabilities; Accounts receivables; Inventory; Property, plant and equipment; Construction in progress; Project material; Accounts payable; Surplus reserve; Retained earnings; Income taxes payable ), five items from the Income Statement (i.e., Sales; Cost of goods sold; Net income; Financial expenses; Income taxes ), and two items from the Cash flow statement (i.e., Cash received from absorbing investments; Proceeds from issuance of bonds ).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…We further identify a list of 28 raw financial data items from corporation financial statements which are used to calculate above selected 12 financial indicators, following Sun et al (2021) and Rahman & Zhu (2023). As shown in Table 2, there are 21 items from the Balance Sheet (i.e., Current assets; Cash; Short‐term investments; Current liabilities; Debt in Current Liabilities; Total assets; Available‐for‐sale securities; Held‐to‐maturity securities; Long term equity securities; Total liabilities; Long‐term debt; Debt in current liabilities; Accounts receivables; Inventory; Property, plant and equipment; Construction in progress; Project material; Accounts payable; Surplus reserve; Retained earnings; Income taxes payable ), five items from the Income Statement (i.e., Sales; Cost of goods sold; Net income; Financial expenses; Income taxes ), and two items from the Cash flow statement (i.e., Cash received from absorbing investments; Proceeds from issuance of bonds ).…”
Section: Methodsmentioning
confidence: 99%
“…We apply two metrics, AUC and NDCG@k, to evaluate different models' fraud detection performance. These two metrics are widely applied to assess the performance of classifiers (Sun et al, 2021; Rahman & Zhu, 2023; Li et al, 2023).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…ML and DL models can be used parallelly to build a model with improved capabilities, as in Hybrid Learning (HL). They can also be combined, and the output of one model is the input for the second, and that is called Ensembled Learning (EL) as in [49] [29]. Accordingly, the training process of CADM uses an HL approach for enhanced performance.…”
Section: Big Data In Accounting and Auditingmentioning
confidence: 99%
“…Other N-FIN features can be included according to their availability in Saudi sources, such as board meetings, meeting minutes, and meeting content, as tested in [26]. As shown in TABLE IV., FIN and N-FIN features selection will follow the criteria in [50], [49], and [51] accordingly. For any AI model to be accurate, variables should be limited [52], though this study proposes to test extended obtainable variables and exclude some in the exploration process if needed.…”
Section: B Variablesmentioning
confidence: 99%