Subclonal architectures are prevalent across cancer types. However, the temporal evolutionary dynamics that produce tumor subclones remain unknown. Here we measure clone dynamics in human cancers by using computational modeling of subclonal selection and theoretical population genetics applied to high-throughput sequencing data. Our method determined the detectable subclonal architecture of tumor samples and simultaneously measured the selective advantage and time of appearance of each subclone. We demonstrate the accuracy of our approach and the extent to which evolutionary dynamics are recorded in the genome. Application of our method to high-depth sequencing data from breast, gastric, blood, colon and lung cancer samples, as well as metastatic deposits, showed that detectable subclones under selection, when present, consistently emerged early during tumor growth and had a large fitness advantage (>20%). Our quantitative framework provides new insight into the evolutionary trajectories of human cancers and facilitates predictive measurements in individual tumors from widely available sequencing data.
The vast majority of cancer next-generation sequencing data consist of bulk samples composed of mixtures of cancer and normal cells. To study tumor evolution, subclonal reconstruction approaches based on machine learning are used to separate subpopulation of cancer cells and reconstruct their ancestral relationships. However, current approaches are entirely data-driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in subclonal reconstruction if tumor evolution is not accounted for, and that those errors increase when multiple samples are taken from the same tumor. To address this issue, we present a novel approach for model-based subclonal reconstruction that combines data-driven machine learning with evolutionary theory. Using public, synthetic and newly generated data, we show the method is more robust and accurate than current techniques in both single-sample and multi-region sequencing data. With careful data curation and interpretation, we show how the method allows minimizing the confounding factors that affect non-evolutionary methods, leading to a more accurate recovery of the evolutionary history of human tumors..
Sequential profiling of plasma cell-free DNA (cfDNA) holds immense promise for early detection of patient progression. However, how to exploit the predictive power of cfDNA as a liquid biopsy in the clinic remains unclear. RAS pathway aberrations can be tracked in cfDNA to monitor resistance to anti-EGFR monoclonal antibodies in patients with metastatic colorectal cancer. In this prospective phase II clinical trial of single-agent cetuximab in wild-type patients, we combine genomic profiling of serial cfDNA and matched sequential tissue biopsies with imaging and mathematical modeling of cancer evolution. We show that a significant proportion of patients defined as wild-type based on diagnostic tissue analysis harbor aberrations in the RAS pathway in pretreatment cfDNA and, in fact, do not benefit from EGFR inhibition. We demonstrate that primary and acquired resistance to cetuximab are often of polyclonal nature, and these dynamics can be observed in tissue and plasma. Furthermore, evolutionary modeling combined with frequent serial sampling of cfDNA allows prediction of the expected time to treatment failure in individual patients. This study demonstrates how integrating frequently sampled longitudinal liquid biopsies with a mathematical framework of tumor evolution allows individualized quantitative forecasting of progression, providing novel opportunities for adaptive personalized therapies. Liquid biopsies capture spatial and temporal heterogeneity underpinning resistance to anti-EGFR monoclonal antibodies in colorectal cancer. Dense serial sampling is needed to predict the time to treatment failure and generate a window of opportunity for intervention. .
Quantification of the effect of spatial tumour sampling on the patterns of mutations detected in next-generation sequencing data is largely lacking. Here we use a spatial stochastic cellular automaton model of tumour growth that accounts for somatic mutations, selection, drift and spatial constraints, to simulate multi-region sequencing data derived from spatial sampling of a neoplasm. We show that the spatial structure of a solid cancer has a major impact on the detection of clonal selection and genetic drift from both bulk and single-cell sequencing data. Our results indicate that spatial constrains can introduce significant sampling biases when performing multi-region bulk sampling and that such bias becomes a major confounding factor for the measurement of the evolutionary dynamics of human tumours. We also propose a statistical inference framework that incorporates spatial effects within a growing tumour and so represents a further step forwards in the inference of evolutionary dynamics from genomic data. Our analysis shows that measuring cancer evolution using next-generation sequencing while accounting for the numerous confounding factors remains challenging. However, mechanistic model-based approaches have the potential to capture the sources of noise and better interpret the data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with đź’™ for researchers
Part of the Research Solutions Family.