2022
DOI: 10.1002/sim.9576
|View full text |Cite
|
Sign up to set email alerts
|

Optimal subsampling for parametric accelerated failure time models with massive survival data

Abstract: With increasing availability of massive survival data, researchers need valid statistical inferences for survival modeling whose computation is not limited by computer memories. Existing works focus on relative risk models using the online updating and divide-and-conquer strategies. The subsampling strategy has not been available due to challenges in developing the asymptotic properties of the estimator under semiparametric models with censored data. This article tackles optimal subsampling algorithms to fast … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…This approach has been extended to a variety of statistical models such as generalized linear models (Ai et al, 2021) and quantile regression models (Wang and Ma, 2021). In the field of survival analysis, this approach has been developed for Cox models (Zhang et al, 2022), Cox models with rare events (Keret and Gorfine, 2022), additive hazard rate models (Zuo et al, 2021), and parametric AFT models (Yang et al, 2022). To the best of our knowledge, however, no prior work has explored its application to semi-parametric AFT models for massive survival data.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…This approach has been extended to a variety of statistical models such as generalized linear models (Ai et al, 2021) and quantile regression models (Wang and Ma, 2021). In the field of survival analysis, this approach has been developed for Cox models (Zhang et al, 2022), Cox models with rare events (Keret and Gorfine, 2022), additive hazard rate models (Zuo et al, 2021), and parametric AFT models (Yang et al, 2022). To the best of our knowledge, however, no prior work has explored its application to semi-parametric AFT models for massive survival data.…”
Section: Introductionmentioning
confidence: 99%
“…In the presence of censoring, the key challenge is to derive the optimal subsampling probability (SSP) for censored observations. The SSP of an observation is proportional to its contribution to the estimating functions in standard approaches (Zhang et al, 2022;Yang et al, 2022). For the rank-based method, it is tempting to assign a zero SSP to censored observations since they do not contribute to the estimating functions for the regression coefficients.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Popular parametric distributions for survival times include the Weibull distribution, the Gompertz distribution, the lognormal distribution, the loglogistic distribution, among many others 1,28‐31 . Recently, a Weibull model is applied 32 to a very large lymphoma data set (prefix≈106$$ \approx 1{0}^6 $$ records) using the online updating and divide‐and‐conquer strategies. A parametric model provides a feasible solution to analyze such massive survival data which are increasingly available with the rapid development of surveillance and storage technologies.…”
Section: Introductionmentioning
confidence: 99%