Background: To revise the American Joint Committee on Cancer TNM staging system for colorectal cancer (CRC) based on a nomogram analysis of Surveillance, Epidemiology, and End Results (SEER) database, and to prove the rationality of enhancing T stage's weighting in our previously proposed T-plus staging system. Methods: Total 115,377 non-metastatic CRC patients from SEER were randomly grouped as training and testing set by ratio 1:1. The Nomo-staging system was established via three nomograms based on 1-year, 2-year and 3-year disease specific survival (DSS) Logistic regression analysis of the training set. The predictive value of Nomo-staging system for the testing set was evaluated by concordance index (c-index), likelihood ratio (L.R.) and Akaike information criteria (AIC) for 1-year, 2-year, 3-year overall survival (OS) and DSS. Kaplan-Meier survival curve was used to valuate discrimination and gradient monotonicity. And an external validation was performed on database from the Second Affiliated Hospital of Zhejiang University (SAHZU).
Background: Colon cancer is common worldwide and is the leading cause of cancer-related death. Multiple levels of omics data are available due to the development of sequencing technologies. In this study, we proposed an integrative prognostic model for colon cancer based on the integration of clinical and multi-omics data. Methods: In total, 344 patients were included in this study. Clinical, gene expression, DNA methylation and miRNA expression data were retrieved from The Cancer Genome Atlas (TCGA). To accommodate the high dimensionality of omics data, unsupervised clustering was used as dimension reduction method. The bias-corrected Harrell's concordance index was used to verify which clustering result provided the best prognostic performance. Finally, we proposed a prognostic prediction model based on the integration of clinical data and multi-omics data. Uno's concordance index with cross-validation was used to compare the discriminative performance of the prognostic model constructed with different covariates. Results: Combinations of clinical and multi-omics data can improve prognostic performance, as shown by the increase of the bias-corrected Harrell's concordance of the prognostic model from 0.7424 (clinical features only) to 0.7604 (clinical features and three types of omics features). Additionally, 2-year, 3-year and 5-year Uno's concordance statistics increased from 0.7329, 0.7043, and 0.7002 (clinical features only) to 0.7639, 0.7474 and 0.7597 (clinical features and three types of omics features), respectively. Conclusion:In conclusion, this study successfully combined clinical and multi-omics data for better prediction of colon cancer prognosis.
The survival risk following curative surgery for nonmetastatic colorectal cancer (CRC) may be over‐ or underestimated due to a lack of attention to nonlinear effects and violation of the proportional hazards assumption. In this paper, we aimed to detect and interpret the shape of time‐dependent and nonlinear effects to improve the predictive performance of models of prognoses in nonmetastatic CRC patients. Data for nonmetastatic CRC patients diagnosed between 2004 and 2012 were obtained from the Surveillance Epidemiology End Results registry. Time‐dependent and nonlinear effects were tested and plotted. A nonlinear model that used random survival forests was implemented. The estimated 5‐year cancer‐specific death rate was 17.95% (95% CI, 17.70–18.20%). Tumor invasion depth, lymph node status, age at diagnosis, tumor grade, histology and tumor site were significantly associated with cancer‐specific death. Nonlinear and time‐dependent effects on survival were detected. Positive lymph node number had a larger effect per unit of measurement at low values than at high values, whereas age at diagnosis showed the opposite pattern. Moreover, nonproportional hazards were detected for all covariates, indicating that the contributions of these risks to survival outcomes decreased over time. The nonlinear model predicted prognoses more accurately (C‐index: 0.7934, 0.7933–0.7934) than did the Fine and Gray model (C‐index: 0.7550, 0.7510–0.7583). The three‐dimensional cumulative incidence curves derived from nonlinear model were used to identify the change points of the risk trends. It would be useful to implement these findings in treatment plans and follow‐up surveillance in nonmetastatic CRC patients.
BackgroundAn increasing number of studies have identified spatial differences in colorectal cancer survival. However, little is known about the spatially varying effects of predictors in survival prediction modeling studies of colorectal cancer that have focused on estimating the absolute survival risk for patients from a wide range of populations. This study aimed to demonstrate the spatially varying effects of predictors of survival for nonmetastatic colorectal cancer patients.MethodsPatients diagnosed with nonmetastatic colorectal cancer from 2004 to 2013 who were followed up through the end of 2013 were extracted from the Surveillance Epidemiology End Results registry (Patients: 128061). The log-rank test and the restricted mean survival time were used to evaluate survival outcome differences among spatial clusters corresponding to a widely used clinical predictor: stage determined by AJCC 7th edition staging system. The heterogeneity test, which is used in meta-analyses, revealed the spatially varying effects of single predictors. Then, considering the above predictors in a standard survival prediction model based on spatially clustered data, the spatially varying coefficients of these models revealed that some covariate effects may not be constant across the geographic regions of the study. Then, two types of survival prediction models (a statistical model and a machine learning model) were built; these models considered the predictors and enabled survival prediction for patients from a wide range of geographic regions.ResultsBased on univariate and multivariate analysis, some prognostic factors, such as “TNM stage”, “tumor size” and “age at diagnosis,” have significant spatially varying effects among different regions. When considering these spatially varying effects, machine learning models have fewer assumption constraints (such as proportional hazard assumptions) and better predictive performance compared with statistical models. Upon comparing the concordance indexes of these two models, the machine learning model was found to be more accurate (0.898[0.895,0.902]) than the statistical model (0.732 [0.726, 0.738]).ConclusionsBased on this study, it’s recommended that the spatially varying effect of predictors should be considered when building survival prediction models involving large-scale and multicenter research data. Machine learning models that are not limited by the requirement of a statistical hypothesis are promising alternative models.Electronic supplementary materialThe online version of this article (10.1186/s12885-018-4985-2) contains supplementary material, which is available to authorized users.
The POPCORN system can build prediction models that perform well in complex clinical application scenarios and can provide effective decision support for individual patient prognostic predictions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.