2020
DOI: 10.1111/1475-679x.12292
|View full text |Cite|
|
Sign up to set email alerts
|

Detecting Accounting Fraud in Publicly Traded U.S. Firms Using a Machine Learning Approach

Abstract: We develop a state-of-the-art fraud prediction model using a machine learning approach. We demonstrate the value of combining domain knowledge and machine learning methods in model building. We select our model input based on existing accounting theories, but we differ from prior accounting research by using raw accounting numbers rather than financial ratios.We employ one of the most powerful machine learning methods, ensemble learning, rather than the commonly used method of logistic regression. To assess th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

6
230
3
4

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 277 publications
(243 citation statements)
references
References 54 publications
(106 reference statements)
6
230
3
4
Order By: Relevance
“…This evidence is consistent with Bao et al. [], who document that the AAER sample is more stable prior to 2005, due to the long time lag in SEC investigations and possible changes in the SEC's enforcement priorities after the 2008 financial crisis. We observe a similar post‐2005 decline in the AA sample.…”
Section: Data and Empirical Measuressupporting
confidence: 91%
See 4 more Smart Citations
“…This evidence is consistent with Bao et al. [], who document that the AAER sample is more stable prior to 2005, due to the long time lag in SEC investigations and possible changes in the SEC's enforcement priorities after the 2008 financial crisis. We observe a similar post‐2005 decline in the AA sample.…”
Section: Data and Empirical Measuressupporting
confidence: 91%
“…We follow prior research and use the area under the receiver operating characteristics (ROC) curve to evaluate the out‐of‐sample classification performance of each detection model (Hobson, Mayew, and Venkatachalam [], Larcker and Zakolyukina [], Bao et al. []). The ROC curve is a two‐dimensional plot across different cutoff thresholds of the true positive rate (sensitivity) on the y ‐axis against the false positive rate (specificity) on the x ‐axis.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations