2016
DOI: 10.1002/sim.6949
|View full text |Cite
|
Sign up to set email alerts
|

Doubly robust survival trees

Abstract: Estimating a patient's mortality risk is important in making treatment decisions. Survival trees are a useful tool and employ recursive partitioning to separate patients into different risk groups. Existing 'loss based' recursive partitioning procedures that would be used in the absence of censoring have previously been extended to the setting of right censored outcomes using inverse probability censoring weighted estimators of loss functions. In this paper, we propose new 'doubly robust' extensions of these l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
68
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 40 publications
(69 citation statements)
references
References 26 publications
1
68
0
Order By: Relevance
“…It is implemented using the rpart package in R. Default tuning parameters in rpart are used apart from that the minimum number of observations in a terminal node is set to 30. To ensure that the positivity assumption holds empirically, a "Method 2" truncation as described in Steingrimsson et al (2016) is used within each terminal node. "Method 2" truncation within a terminal node modifies the data by setting the failure time indicator for the largest 10% of observations falling in each terminal node to one.…”
Section: Algorithms Estimating P (T ≥ T|w )mentioning
confidence: 99%
See 1 more Smart Citation
“…It is implemented using the rpart package in R. Default tuning parameters in rpart are used apart from that the minimum number of observations in a terminal node is set to 30. To ensure that the positivity assumption holds empirically, a "Method 2" truncation as described in Steingrimsson et al (2016) is used within each terminal node. "Method 2" truncation within a terminal node modifies the data by setting the failure time indicator for the largest 10% of observations falling in each terminal node to one.…”
Section: Algorithms Estimating P (T ≥ T|w )mentioning
confidence: 99%
“…IPCW estimators are inefficient as they fail to utilize information in censored observations. Using semiparametric efficiency theory for missing data, Steingrimsson et al (2016) developed a "doubly robust" estimator for the full data risk that is both more efficient and more robust to the modeling choices made than the IPCW loss. Steingrimsson et al (2017) proposed a class of censoring unbiased loss (CUL) functions that includes both the IPCW and the "doubly robust" losses as a special case.…”
Section: Introductionmentioning
confidence: 99%
“…The MSE is defined as 10001i=11000false(trueα^false(Xifalse)αfalse(Xifalse)false)2, where X i , i =1,…,1000 are the covariates from the test set. Proportion of correct trees: For a continuous covariate the probability of getting exactly the correct split point is zero. Following the work of Steingrimsson et al, we define a tree to be correct if it splits on all variables the correct number of times independently of the ordering or the selection of splitting point. Number of noise variables: The average number of times the tree splits on the noise variables ( X (1) − X (5) for the homogeneous treatment effect setting and X (2) − X (5) for the heterogeneous treatment effect setting). Pairwise prediction similarity: Let I T ( i , j ) and I M ( i , j ) be indicators whether participants i and j have the same prediction when run down the true tree and the fitted tree, respectively. Pairwise prediction similarity is defined as 1i=11000j>i1000|IT(i,j)IM(i,j)|010002. Hence, pairwise prediction similarity measures the ability of the algorithms to separate participants into different risk groups. Time: Average time in seconds it takes to build the models.…”
Section: Simulationsmentioning
confidence: 99%
“…For example, Davis and Anderson (1989) assumed that the survival time within any given node follows the exponential distribution; LeBlanc and Crowley (1992) adopted a proportional hazards model with an un-specified baseline hazard. More recently, the squared error loss commonly used in regression trees are extended to handle censored data (Molinaro et al, 2004;Steingrimsson et al, 2016Steingrimsson et al, , 2018. Other possible splitting criteria include a weighted sum of impurity of the censoring indicator and the squared error loss of the observed event time (Zhang, 1995), and the Harrell's C-statistic (Schmid et al, 2016).…”
Section: Introductionmentioning
confidence: 99%