Background
Privacy restrictions limit access to protected patient-derived health information for research purposes. Consequently, data anonymization is required to allow researchers data access for initial analysis before granting institutional review board approval. A system installed and activated at our institution enables synthetic data generation that mimics data from real electronic medical records, wherein only fictitious patients are listed.
Objective
This paper aimed to validate the results obtained when analyzing synthetic structured data for medical research. A comprehensive validation process concerning meaningful clinical questions and various types of data was conducted to assess the accuracy and precision of statistical estimates derived from synthetic patient data.
Methods
A cross-hospital project was conducted to validate results obtained from synthetic data produced for five contemporary studies on various topics. For each study, results derived from synthetic data were compared with those based on real data. In addition, repeatedly generated synthetic datasets were used to estimate the bias and stability of results obtained from synthetic data.
Results
This study demonstrated that results derived from synthetic data were predictive of results from real data. When the number of patients was large relative to the number of variables used, highly accurate and strongly consistent results were observed between synthetic and real data. For studies based on smaller populations that accounted for confounders and modifiers by multivariate models, predictions were of moderate accuracy, yet clear trends were correctly observed.
Conclusions
The use of synthetic structured data provides a close estimate to real data results and is thus a powerful tool in shaping research hypotheses and accessing estimated analyses, without risking patient privacy. Synthetic data enable broad access to data (eg, for out-of-organization researchers), and rapid, safe, and repeatable analysis of data in hospitals or other health organizations where patient privacy is a primary value.
BackgroundCommunity acquired pneumonia (CAP) is a major cause of morbidity and mortality. We recently demonstrated that among young patients (<60 years old) with CAP, elevated red blood cell distribution width (RDW) level on admission was associated with significant higher rates of mortality and severe morbidity. We aimed to investigate the prognostic predictive value of RDW among CAP patients in general population of internal wards.MethodsThe cohort included patients of 18 years old or older who were diagnosed with CAP (defined as pneumonia identified 48 hours or less from hospitalization) between January 1, 2005 and December 31, 2010. Patients were retrospectively analyzed for risk factors for a primary endpoint of 90-day mortality. Secondary endpoint was defined as complicated hospitalization (defined as at least one of the following: In- hospital mortality, length of stay of at least 10 days or ICU admission). Binary logistic regression analysis was used for the calculation of the odds ratios (OR) and p values in univariate and multivariate analysis to identify association between patient characteristic, 90-day mortality and complicated hospitalization.ResultsThe cohort included 3815 patients. In univariate analysis, patients with co-morbid conditions tended to have a complicated course of CAP. In multivariate regression analysis, variables associated with an increased risk of 90-day mortality included age > 70 years, high Charlson comorbidity index (>2), Hb < 10 mg/dl, Na <130 meq/l, blood urea nitrogen (BUN) >30 mg/dl, systolic blood pressure < 90 mmHg and elevated RDW >15%. Variables associated with complicated hospitalization included high Charlson comorbidity index, BUN > 30 mg/dl, hemoglobin < 10 g/dl, heart rate >124 bpm, systolic blood pressure < 90 mmHg and elevated RDW. Mortality rate and complicated hospitalization were significantly higher among patients with increased RDW regardless of the white blood cell count or hemoglobin levels.ConclusionsElevated RDW levels on admission are associated with significant higher rates of mortality and severe morbidity in adult patients with CAP. RDW as a prognostic marker was unrelated with hemoglobin levels, WBC count, age or Charlson score.
The DT is not sufficient to identify spiritual distress. The peace subscale of FACIT-Sp-12 is a better match than the measure as a whole. The SIS is the best match for spiritual distress, although an imperfect one.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.