2022
DOI: 10.1109/tits.2020.3009186
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets

Abstract: A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the poten… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 44 publications
0
5
0
Order By: Relevance
“…Some other works apply image augmentations to the generated samples instead of focusing on the 3D scene generation and rendering step. [32] generate new augmented synthetic samples using a semisupervised errors-guide method to improve the DNN accuracy on cross-domain datasets.…”
Section: Dnn Training With Synthetic and Real Datamentioning
confidence: 99%
“…Some other works apply image augmentations to the generated samples instead of focusing on the 3D scene generation and rendering step. [32] generate new augmented synthetic samples using a semisupervised errors-guide method to improve the DNN accuracy on cross-domain datasets.…”
Section: Dnn Training With Synthetic and Real Datamentioning
confidence: 99%
“…Synthetic data has demonstrated robust results in overcoming data limitations for various tasks such as dataset balancing (Xiao, Wu, and Lin 2021), data analysis (Cortés et al 2022;Zhang et al 2019;Lou et al 2022) and privacy preservation (Faisal et al 2022;Liu et al 2022). Nowadays, the main popular methods to synthesize tabular data by using deep learning are based on GANs.…”
Section: Related Work Tabular Data Synthesismentioning
confidence: 99%
“…Nevertheless, owing to the ever-present irreconcilable conflict between data leakage risk and availability, such approaches still have a loss of raw information, which results in poorly trained AI models (Cheng et al 2022). The adoption of synthetic data for machine learning has gained significant traction in recent years (Cortés et al 2022). Researchers have been exploring a well-secured approach to generate synthetic data that closely resembles real data, often referred to as "almost-butnot-quite replica data" (Ganev, Oprisanu, and De Cristofaro 2022).…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, comparing metrics between other algorithms is next to impossible. Synthetic data generation has proven to be a good approach to tackle the lack of data (Cortes et al, 2022). Data Augmentation is the first step that most of the current algorithms use in order to increment the amount of data with few effort (Zoph et al, 2020;Shin et al, 2020).…”
Section: Related Workmentioning
confidence: 99%