2021
DOI: 10.31234/osf.io/d7zwj
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using synthetic data to improve the reproducibility of statistical results in psychological research

Abstract: In recent years, psychological research has faced a credibility crisis, and open data are often regarded as an important step toward a more reproducible psychological science. However, privacy concerns are among the main reasons that prevent data sharing. Synthetic data procedures, which are based on the multiple imputation (MI) approach to missing data, can be used to replace sensitive data with simulated values, which can be analyzed in place of the original data. One crucial requirement of this approach is … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 44 publications
0
5
0
Order By: Relevance
“…To this end, we investigated the behavior of these methods both with correctly specified synthesis models and under different types of misspecification, reflecting applications of synthetic data in which the analyses were either consistent with the relations that were featured in the original analysis and specified in the synthesis model (reproducibility Stages 1 and 2) or went beyond them (Stages 3 and 4). The materials used to conduct this study, including the computer code and all syntax files, are provided on the OSF (https://osf.io/3a5uq/; Grund et al, 2021).…”
Section: Study 1: Reproducibility and Robustnessmentioning
confidence: 99%
“…To this end, we investigated the behavior of these methods both with correctly specified synthesis models and under different types of misspecification, reflecting applications of synthetic data in which the analyses were either consistent with the relations that were featured in the original analysis and specified in the synthesis model (reproducibility Stages 1 and 2) or went beyond them (Stages 3 and 4). The materials used to conduct this study, including the computer code and all syntax files, are provided on the OSF (https://osf.io/3a5uq/; Grund et al, 2021).…”
Section: Study 1: Reproducibility and Robustnessmentioning
confidence: 99%
“…Although combining the raw data of multiple studies in individual participant meta-analyses is preferable from a methodological point of view (Riley et al, 2010), most psychological studies do not provide the respective raw data (Hardwicke et al, 2021;Nutu et al, 2019). Particularly, in clinical research often legal restrictions or ethical considerations prevent sharing the raw data (see Grund et al, 2021, for a potential remedy). Therefore, meta-analyses of summary statistics are the only viable solution in many situations.…”
Section: Randomized Control Trials For Continuous Outcomesmentioning
confidence: 99%
“…Generally, there are three ways to impute data: joint modeling, sequential modeling, and fully conditional specification [12,16,17]. With joint modeling, the entire joint distribution of the data is specified as a single multivariate distribution, and the imputations are drawn from this model.…”
Section: Generating Synthetic Data With Micementioning
confidence: 99%
“…Choosing an adequate imputation model to impute the data is paramount, as a flawed imputation model may drastically impact the validity of inferences [12,13]. Imputation models should be as flexible as possible to capture most of the patterns in the data, and to model possibly unanticipated data characteristics [16,24].…”
Section: Generating Synthetic Data With Micementioning
confidence: 99%
See 1 more Smart Citation