We describe a comprehensive modeling approach to combining genomic and clinical data for personalized prediction in disease outcome studies. This integrated clinicogenomic modeling framework is based on statistical classification tree models that evaluate the contributions of multiple forms of data, both clinical and genomic, to define interactions of multiple risk factors that associate with the clinical outcome and derive predictions customized to the individual patient level. Gene expression data from DNA microarrays is represented by multiple, summary measures that we term metagenes; each metagene characterizes the dominant common expression pattern within a cluster of genes. A case study of primary breast cancer recurrence demonstrates that models using multiple metagenes combined with traditional clinical risk factors improve prediction accuracy at the individual patient level, delivering predictions more accurate than those made by using a single genomic predictor or clinical data alone. The analysis also highlights issues of communicating uncertainty in prediction and identifies combinations of clinical and genomic risk factors playing predictive roles. Implicated metagenes identify gene subsets with the potential to aid biological interpretation. This framework will extend to incorporate any form of data, including emerging forms of genomic data, and provides a platform for development of models for personalized prognosis. G enomic information, in the form of gene expression patterns, has an established capacity to define clinically relevant risk factors in disease prognosis. Recent studies have generated such patterns related to lymph node metastasis and disease recurrence in breast cancer (1-8), as well as in other cancers and disease contexts (9-16). The challenge now is the integration of such genomic information into prognostic models that can be applied in a clinical setting to improve the accuracy of treatment decisions.Achievement of this goal requires modeling approaches that focus on the generation of predictions for the individual patient and that can evaluate and combine multiple risk factors to produce informed predictions. Gene expression profiles may indeed prove to be powerful individual indicators of tumor behavior, but analysis should not force a choice of one form of data over the other; rather, analysis should evaluate and combine all forms of potentially relevant information. This integrative view underlies our development of clinicogenomic models and should underlie prognostic systems in support of personalized health planning.Consistent with this view, the example of breast cancer recurrence presented here highlights the predictive value of multiple genomic patterns in models defining accurate predictions at the individual patient level. This analysis uses integrative models that combine clinical and genomic factors, such as multiple gene expression patterns, clinical risk factors, and treatment information, and that predict recurrence for individual patients. The example shows im...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.