Proceedings of the 35th Annual ACM Symposium on Applied Computing 2020
DOI: 10.1145/3341105.3374084
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale machine learning for business sector prediction

Abstract: In this study we use machine learning to perform explainable business sector prediction from financial statements. Financial statements are a valuable source of information on the financial state and performance of firms. Recently, large-scale data on financial statements has become available in the form of open data sets. Previous work on such data mainly focused on predicting fraud and bankruptcy. In this paper we devise a model for business sector prediction, which has several valuable applications, includi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 19 publications
(29 reference statements)
0
3
0
Order By: Relevance
“…Random undersampling was used in previous studies by Angenent et al (2020), Kou et al (2021), Le, Vo, Vo, et al (2019), Liu et al (2022), Wang et al (2018), and Zhou et al (2016), and random oversampling was applied in the studies by Angenent et al (2020), Kou et al (2021), Le, Vo, Vo, et al (2019), and Liu et al (2022), whereas SMOTE was applied in the studies by Farooq and Qamar (2019), Kou et al (2021), Le, Vo, Vo, et al (2019), Liu et al (2022), Roumani et al (2020), Wang et al (2019), and Tumpach et al (2020). The Ovun.sample function from the “ROSE” version 0.0‑3 package in RStudio was used for all three resampling approaches applied in this study.…”
Section: Methodsmentioning
confidence: 99%
“…Random undersampling was used in previous studies by Angenent et al (2020), Kou et al (2021), Le, Vo, Vo, et al (2019), Liu et al (2022), Wang et al (2018), and Zhou et al (2016), and random oversampling was applied in the studies by Angenent et al (2020), Kou et al (2021), Le, Vo, Vo, et al (2019), and Liu et al (2022), whereas SMOTE was applied in the studies by Farooq and Qamar (2019), Kou et al (2021), Le, Vo, Vo, et al (2019), Liu et al (2022), Roumani et al (2020), Wang et al (2019), and Tumpach et al (2020). The Ovun.sample function from the “ROSE” version 0.0‑3 package in RStudio was used for all three resampling approaches applied in this study.…”
Section: Methodsmentioning
confidence: 99%
“…Information extracted from other areas, such as clinical registers, that may be combined with police reports of sexual or domestic violence [42,67,68,87] or from managerial and financial reports to support the detection of fraud [49,[105][106][107] are necessary for the enrichment of the analytical process. Combining clinical and police reports can help health professionals understand the causes of trauma that generate psychological illness in victims, supporting a more adjusted treatment, and the police to understand sexual or domestic violence patterns to prevent the occurrence or recurrence of related situations.…”
Section: Scenario Changes and Extensionsmentioning
confidence: 99%
“…DT, SVM, ANN, k-NN, etc. [43,120,129,136] Misclassification costs, exceptionally putting more focus on the minority class.…”
Section: Thresholdmentioning
confidence: 99%