2021
DOI: 10.1111/biom.13512
|View full text |Cite
|
Sign up to set email alerts
|

Efficient odds ratio estimation under two‐phase sampling using error‐prone data from a multi‐national HIV research cohort

Abstract: Persons living with HIV engage in routine clinical care, generating large amounts of data in observational HIV cohorts. These data are often error‐prone, and directly using them in biomedical research could bias estimation and give misleading results. A cost‐effective solution is the two‐phase design, under which the error‐prone variables are observed for all patients during Phase I, and that information is used to select patients for data auditing during Phase II. For example, the Caribbean, Central, and Sout… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(15 citation statements)
references
References 34 publications
0
15
0
Order By: Relevance
“…Two‐phase stratified sampling has been discussed/used by many authors, for example, 20‐23 ) to select subjects for the collection of additional data, for example, validation data for addressing measurement error problems. Stratification jointly by outcome and covariates, with sampling fractions chosen to enhance efficiency compared with stratification based on the outcome or covariates alone.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Two‐phase stratified sampling has been discussed/used by many authors, for example, 20‐23 ) to select subjects for the collection of additional data, for example, validation data for addressing measurement error problems. Stratification jointly by outcome and covariates, with sampling fractions chosen to enhance efficiency compared with stratification based on the outcome or covariates alone.…”
Section: Discussionmentioning
confidence: 99%
“…However, this method appears not robust to misspecification of the distribution relating the surrogate to the true outcome, Pbold-italicθfalse(Sfalse|Y,bold-italicXfalse)$$ {P}_{\boldsymbol{\theta}}\left(S|Y,\boldsymbol{X}\right) $$. Lotspeich et al 20 proposed a robust sieve maximum likelihood estimator employing a nonparametric model for the exposure's measurement errors. Although Lotspeich et al 20 found that logistic regression can be fairly robust to certain types of model misspecification, a proper specification of conditional probability Pbold-italicθfalse(Sfalse|Y,bold-italicXfalse)$$ {P}_{\boldsymbol{\theta}}\left(S|Y,\boldsymbol{X}\right) $$ was stated to be desirable in their proposed likelihood and discussed it as a limitation in the Discussion Section of the paper.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations