We present SIEVE, a statistical method for the joint inference of somatic variants and cell phylogeny under the finite-sites assumption from single-cell DNA sequencing. SIEVE leverages raw read counts for all nucleotides and corrects the acquisition bias of branch lengths. In our simulations, SIEVE outperforms other methods in phylogenetic reconstruction and variant calling accuracy, especially in the inference of homozygous variants. Applying SIEVE to three datasets, one for triple-negative breast (TNBC), and two for colorectal cancer (CRC), we find that double mutant genotypes are rare in CRC but unexpectedly frequent in the TNBC samples.
Single-cell DNA sequencing (scDNA-seq) has enabled the identification of single nucleotide somatic variants and the reconstruction of cell phylogenies. However, statistical phylogenetic models for cell phylogeny reconstruction from raw sequencing data are still in their infancy. Here we present SIEVE (SIngle-cell EVolution Explorer), a statistical method for the joint inference of somatic variants and cell phylogeny under the finite-sites assumption from scDNA-seq reads. SIEVE leverages raw read counts for all nucleotides at candidate variant sites, and corrects the acquisition bias of branch lengths. In our simulations, SIEVE outperforms other methods both in phylogenetic accuracy and variant calling accuracy. We apply SIEVE to three scDNA-seq datasets, for colorectal (CRC) and triple-negative breast cancer (TNBC), one of them generated by us. On simulated data, SIEVE reliably infers homo- and heterozygous somatic variants. The analysis of real data uncovers that double mutant genotypes are rare in CRC but unexpectedly frequent in TNBC samples.
Discovering synthetic lethal (SL) gene partners of cancer genes is an important step in developing cancer therapies. However, identification of SL interactions is challenging, due to a large number of possible gene pairs, inherent noise and confounding factors in the observed signal. To discover robust SL interactions, we devised SLIDE-VIP, a novel framework combining eight statistical tests, including a new patient data-based test iSurvLRT. SLIDE-VIP leverages multi-omics data from four different sources: gene inactivation cell line screens, cancer patient data, drug screens and gene pathways. We applied SLIDE-VIP to discover SL interactions between genes involved in DNA damage repair, chromatin remodeling and cell cycle, and their potentially druggable partners. The top 883 ranking SL candidates had strong evidence in cell line and patient data, 250-fold reducing the initial space of 200K pairs. Drug screen and pathway tests provided additional corroboration and insights into these interactions. We rediscovered well-known SL pairs such as RB1 and E2F3 or PRKDC and ATM, and in addition, proposed strong novel SL candidates such as PTEN and PIK3CB. In summary, SLIDE-VIP opens the door to the discovery of SL interactions with clinical potential. All analysis and visualizations are available via the online SLIDE-VIP WebApp.
Discovering synthetic lethal (SL) gene partners of cancer genes is an important step in developing cancer therapies. However, identification of SL interactions is challenging, due to a large number of possible gene pairs, inherent noise and confounding factors in the observed signal. To discover robust SL interactions, we devised SLIDE-VIP, a novel framework combining eight statistical tests, including a new patient data-based test iSurvLRT. SLIDE-VIP leverages multi-omics data from four different sources: gene inactivation cell line screens, cancer patient data, drug screens and gene pathways. We applied SLIDE-VIP to discover SL interactions between genes involved in DNA damage repair, chromatin remodeling and cell cycle, and their potentially druggable partners. The top 883 ranking SL candidates had strong evidence in cell line and patient data, 250-fold reducing the initial space of 200K pairs. Drug screen and pathway tests provided additional corroboration and insights into these interactions. We rediscovered well-known SL pairs such as RB1 and E2F3 or PRKDC and ATM, and in addition, proposed strong novel SL candidates such as PTEN and PIK3CB. In summary, SLIDE-VIP opens the door to the discovery of SL interactions with clinical potential. All analysis and visualizations are available via the online SLIDE-VIP WebApp.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.