2023
DOI: 10.1371/journal.pone.0283094
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning models trained on synthetic datasets of multiple sample sizes for the use of predicting blood pressure from clinical data in a national dataset

Abstract: Introduction The potential for synthetic data to act as a replacement for real data in research has attracted attention in recent months due to the prospect of increasing access to data and overcoming data privacy concerns when sharing data. The field of generative artificial intelligence and synthetic data is still early in its development, with a research gap evidencing that synthetic data can adequately be used to train algorithms that can be used on real data. This study compares the performance of a serie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…Moreover, they have the potential to identify nonlinear relationships between known deposits and evidence layers. With advancements in technology, such as faster computers and larger datasets, machine learning algorithms have shown their full potential [29,30]. It is now recognized that a good representation of data and the availability of large amounts of example data are crucial for successful machine learning [31,32].…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, they have the potential to identify nonlinear relationships between known deposits and evidence layers. With advancements in technology, such as faster computers and larger datasets, machine learning algorithms have shown their full potential [29,30]. It is now recognized that a good representation of data and the availability of large amounts of example data are crucial for successful machine learning [31,32].…”
Section: Introductionmentioning
confidence: 99%
“…Finally, synthetic data, despite its recognized advantages, carries some risks of being used maliciously or as a means to bypass data protection legislation, if not used properly [3]. Therefore, for the future steps, while we explore the use of synthetic data in the LA domain for multivariate data types (e.g., time-series data), we should also explore how to prevent the misuse of synthetic data before malicious use of synthetic data occurs.…”
mentioning
confidence: 99%