2016
DOI: 10.1007/978-3-319-42291-6_84
|View full text |Cite
|
Sign up to set email alerts
|

Partially Synthesised Dataset to Improve Prediction Accuracy

Abstract: Abstract. The real world data sources, such as statistical agencies, library databanks and research institutes are the major data sources for researchers. Using this type of data involves several advantages including, the improvement of credibility and validity of the experiment and more importantly, it is related to a real world problems and typically unbiased. However, this type of data is most likely unavailable or inaccessible for everyone due to the following reasons. First, privacy and confidentiality co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Simulation and prediction research requires a large number of datasets to precisely predict behaviors and outcomes [31]. Real-world sources (e.g., from statistical agencies) have a significant advantage but are also most likely to be inaccessible to most researchers [32].…”
Section: Simulation Studies and Predictive Analyticsmentioning
confidence: 99%
See 1 more Smart Citation
“…Simulation and prediction research requires a large number of datasets to precisely predict behaviors and outcomes [31]. Real-world sources (e.g., from statistical agencies) have a significant advantage but are also most likely to be inaccessible to most researchers [32].…”
Section: Simulation Studies and Predictive Analyticsmentioning
confidence: 99%
“…Synthetic data has been used in disease-specific hybrid simulation [33] and microsimulation for testing policy options [24,34] and health care financing strategies evaluation [35]. Studies also used synthetic data to validate simulation and prediction models [36] and to improve prediction accuracy [32].…”
Section: Simulation Studies and Predictive Analyticsmentioning
confidence: 99%
“…Synthetic data has the potential to estimate the benefit of screening and healthcare policies, treatments, or clinical interventions, augment machine learning algorithms (e.g., image classification pipelines), pre-train machine learning models that can then be fine-tuned for specific patient populations, and improve public health models to predict outbreaks of infectious diseases [17][18][19][20][21][22][23][24][25][26][27] .…”
Section: Applications Of Synthetic Data In Healthcarementioning
confidence: 99%
“…An alternative to data augmentation is the creation of artificial images through the construction of images that mimic the appearance of vessel elements. Partially synthesised data proved to be useful in improving accuracy in machine learning classifiers [Aljaaf et al 2016]. The advantage of using artificial images such as those generated by Antczak et al [Antczak and Liberadzki 2018] for simulating vessels in angiographies is the fact that they are not related to the original dataset.…”
Section: Introductionmentioning
confidence: 99%