2021
DOI: 10.48550/arxiv.2109.14001
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Analysis of Error-prone Electronic Health Records with Multi-wave Validation Sampling: Association of Maternal Weight Gain during Pregnancy with Childhood Outcomes

Bryan E. Shepherd,
Kyunghee Han,
Tong Chen
et al.

Abstract: Electronic health record (EHR) data are increasingly used for biomedical research, but these data have recognized data quality challenges. Data validation is necessary to use EHR data with confidence, but limited resources typically make complete data validation impossible. Using EHR data, we illustrate prospective, multi-wave, two-phase validation sampling to estimate the association between maternal weight gain during pregnancy and the risks of her child developing obesity or asthma. The optimal validation s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…However, if wave I is too large, the number sampled in some strata in wave I may already be larger than the number required by the optimal design, leading to sub-optimality, so that appropriate choice of wave I size is important. Shepherd et al (2021) studied maternal and child risk factors for childhood obesity and asthma in 10,335 children, using electronic health record data augmented by a Phase II sample of 996 manually audited records. A six-wave sampling design was used to choose a validation sample for gestational age, weight gain during pregnancy, and asthma status.…”
Section: Multiwave Samplingmentioning
confidence: 99%
See 1 more Smart Citation
“…However, if wave I is too large, the number sampled in some strata in wave I may already be larger than the number required by the optimal design, leading to sub-optimality, so that appropriate choice of wave I size is important. Shepherd et al (2021) studied maternal and child risk factors for childhood obesity and asthma in 10,335 children, using electronic health record data augmented by a Phase II sample of 996 manually audited records. A six-wave sampling design was used to choose a validation sample for gestational age, weight gain during pregnancy, and asthma status.…”
Section: Multiwave Samplingmentioning
confidence: 99%
“…Performing the validation on a well-selected subsample gives the benefits of chart review at lower cost. Recently, Shepherd et al (2021) studied the association of maternal weight gain during pregnancy with child risk factors using the EHR data. A subsample of records, which are selected by the optimal design for design-based analysis, is validated by manual chart review.…”
Section: Introductionmentioning
confidence: 99%