Background Durable remission has been observed in patients with relapsed or refractory (R/R) large B-cell lymphoma (LBCL) treated with chimeric antigen receptor (CAR) T-cell therapy. Consequently, hazard functions for overall survival (OS) are often complex, requiring the use of flexible methods for extrapolations. Objectives We aimed to retrospectively compare the predictive accuracy of different survival extrapolation methods and evaluate the validity of goodness-of-fit (GOF) criteria-based model selection for CAR T-cell therapies in R/R LBCL. Methods OS data were sourced from JULIET, ZUMA-1, and TRANSCEND NHL 001. Standard parametric, mixture cure, cubic spline, and mixture models were fit to multiple database locks (DBLs), with varying follow-up durations. GOF was assessed using the Akaike information criterion and Bayesian information criterion. Predictive accuracy was calculated as the mean absolute error (MAE) relative to OS observed in the most mature DBL. Results For all studies, mixture cure and cubic spline models provided the best predictive accuracy for the least mature DBL (MAE 0.013-0.085 and 0.014-0.128, respectively). The predictive accuracy of the standard parametric and mixture models showed larger variation (MAE 0.024-0.162 and 0.013-0.176, respectively). With increasing data maturity, the predictive accuracy of standard parametric models remained poor. Correlation between GOF criteria and predictive accuracy was low, particularly for the least mature DBL. Conclusions Our analyses demonstrated that mixture cure and cubic spline models provide the most accurate survival extrapolations of CAR T-cell therapies in LBCL. Furthermore, GOF should not be the only criteria used when selecting the optimal survival model.