2021 22nd International Arab Conference on Information Technology (ACIT) 2021
DOI: 10.1109/acit53391.2021.9677302
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Synthetic Data Generation for Machine Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(9 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…This data set can be used with any machine‐learning module, thus allowing other researchers to utilize it in their research. Synthetic data generation for machine‐learning is an active area of research, especially in the domain of medicine and health care, where the privacy of patient data is a big concern [3, 14, 22, 29, 40, 46]. To the best of our knowledge, no existing open‐source contribution automate the diagnosis of the three hearing loss attributes: type, degree, and configuration.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This data set can be used with any machine‐learning module, thus allowing other researchers to utilize it in their research. Synthetic data generation for machine‐learning is an active area of research, especially in the domain of medicine and health care, where the privacy of patient data is a big concern [3, 14, 22, 29, 40, 46]. To the best of our knowledge, no existing open‐source contribution automate the diagnosis of the three hearing loss attributes: type, degree, and configuration.…”
Section: Related Workmentioning
confidence: 99%
“…This data set can be used with any machine‐learning module, thus allowing other researchers to utilize it in their research. Synthetic data generation for machine‐learning is an active area of research, especially in the domain of medicine and health care, where the privacy of patient data is a big concern [3, 14, 22, 29, 40, 46].…”
Section: Related Workmentioning
confidence: 99%
“…However, improving data sources to feed artificial intelligence (AI) algorithms is not limited to DA exclusively. Therefore, some studies have decided to take the path of building synthetic traffic generators to build their datasets almost from scratch; some examples focusing on this aspect are [17][18][19]. In this way, they are able to abstract from the dataset itself, which is only necessary to understand the distribution of the data.…”
Section: Related Workmentioning
confidence: 99%
“…There are four main techniques that allow deep models to be trained on small datasets. First is synthetic data generation [35][36][37]. If meaningful data can be artificially computer generated, this can reduce the need for timely manual collection and labeling.…”
Section: Introductionmentioning
confidence: 99%