Proceedings of the First ACM International Conference on AI in Finance 2020
DOI: 10.1145/3383455.3422554
|View full text |Cite
|
Sign up to set email alerts
|

Generating synthetic data in finance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
41
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 99 publications
(41 citation statements)
references
References 37 publications
0
41
0
Order By: Relevance
“…Transaction. This dataset is collected from the anti-money laundering (AML) financial system [3,6] which provides transaction records between users over time. At each timestamp, we construct a transaction network to represent the transactions occurring inside this timestamp.…”
Section: A Analysismentioning
confidence: 99%
“…Transaction. This dataset is collected from the anti-money laundering (AML) financial system [3,6] which provides transaction records between users over time. At each timestamp, we construct a transaction network to represent the transactions occurring inside this timestamp.…”
Section: A Analysismentioning
confidence: 99%
“…The highly-sensitive nature of the financial data and the resulting access restrictions are major obstacles in this process. Latest proposals attempt to tackle these issues by improving the quality of synthetic data [9] and building legal-technical frameworks in order to balance the opposing openness and privacy forces [135]. (iii) Non-Profits and AI Consortia: Lately, a number of non-profits and AI consortia has been formed to primarily focus on responsible and trustworthy AI development (e.g.…”
Section: Ai Ethics Organizationsmentioning
confidence: 99%
“…In this context, the term 'synthetic data' denotes data of a generative process grounded in the inherent properties of real data. Modeling these processes provides a nuanced understanding of underlying patterns, unlocking insights, especially in high-stake domains (Assefa et al 2020;Schreyer et al 2019). This idea stands apart from conventional data obfuscation methods such as anonymization or eliminating sensitive attributes.…”
Section: Introductionmentioning
confidence: 99%
“…To effectively train these models, characterized by extensive parameters, substantial compute resources, and extensive training data are crucial (de Goede 2023). However, challenges arise when sensitive data is distributed across various institutions, such as hospitals, municipalities, and financial authorities, and cannot be shared due to privacy concerns (Assefa et al 2020;Schreyer, Sattarov, and Borth 2022). Recently, the concept of Federated Learning (FL) was proposed in which multiple devices, such as smartphones or servers, collaboratively train AI models under the orchestration of a central server (Kairouz et al 2019).…”
Section: Introductionmentioning
confidence: 99%