Single-cell transcriptomic analysis is widely used to study human tumors. However it remains challenging to distinguish normal cell types in the tumor microenvironment from malignant cells and to resolve clonal substructure within the tumor. To address these challenges, we developed an integrative Bayesian segmentation approach called CopyKAT (Copynumber Karyotyping of Aneuploid Tumors) to estimate genomic copy number profiles at an average genomic resolution of 5Mb from read depth in high-throughput scRNA-seq data. We applied CopyKAT to analyze 46,501 single cells from 21 tumors, including triple-negative breast cancer, pancreatic ductal adenocarcinomas, anaplastic thyroid cancer, invasive ductal carcinoma and glioblastoma to accurately (98%) distinguish cancer cells from normal cell types. In three breast tumors, CopyKAT resolved clonal subpopulations that differed in the expression of cancer genes such as
KRAS
and signatures including EMT, DNA repair, apoptosis and hypoxia. These data show that CopyKAT can aid the analysis of scRNA-seq data in a variety of solid human tumors.
Ductal carcinoma in situ (DCIS) is an early-stage breast cancer that infrequently progresses to invasive ductal carcinoma (IDC). Genomic evolution has been difficult to delineate during invasion due to intratumor heterogeneity and the low number of tumor cells in the ducts. To overcome these challenges, we developed Topographic Single Cell Sequencing (TSCS) to measure genomic copy number profiles of single tumor cells while preserving their spatial context in tissue sections. We applied TSCS to 1,293 single cells from 10 synchronous patients with both DCIS and IDC regions in addition to exome sequencing. Our data reveal a direct genomic lineage between in situ and invasive tumor subpopulations and further show that most mutations and copy number aberrations evolved within the ducts prior to invasion. These results support a multiclonal invasion model, in which one or more clones escape the ducts and migrate into the adjacent tissues to establish the invasive carcinomas.
Summary
Our knowledge of copy number evolution during the expansion of primary breast tumors is limited
1
,
2
. To investigate this process, we developed a single cell, single-molecule DNA sequencing method and performed copy number analysis of 16,178 single cells from 8 triple-negative breast cancers (TNBCs) and 4 cell lines. Our data shows that breast tumors and cell lines are comprised of a large milieu of subclones (7–22) that are organized into a few (3–5) major superclones. Evolutionary analysis suggests that after clonal
TP53
mutations, multiple LOH events and genome doubling, there was a period of transient genomic instability followed by ongoing copy number evolution during the primary tumor expansion. By subcloning single daughter cells in culture, we show that tumor cells re-diversify their genomes and do not retain isogenic properties. These data show that TNBCs continue to evolve chromosome aberrations and maintain a reservoir of subclonal diversity during primary tumor growth.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.