2016
DOI: 10.1101/047795
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Tree inference for single-cell data

Abstract: Understanding the mutational heterogeneity within tumors is a keystone for the development of efficient cancer therapies. Here, we present SCITE, a stochastic search algorithm to identify the evolutionary history of a tumor from noisy and incomplete mutation profiles of single cells. SCITE comprises a flexible Markov chain Monte Carlo sampling scheme that allows the user to compute the maximum-likelihood mutation history, to sample from the posterior probability distribution, and to estimate the error rates of… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
206
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 82 publications
(210 citation statements)
references
References 42 publications
4
206
0
Order By: Relevance
“…Methods address this challenge by using an evolutionary model to infer a phylogeny while simultaneously imputing missing data and correcting errors in the observed SNVs. Algorithms such as SCITE 21 , On-coNEM 22 , SciΦ 23 , and B-SCITE 24 use the simplest phylogenetic model for SNVs, the infinite sites model.…”
Section: Missionmentioning
confidence: 99%
See 1 more Smart Citation
“…Methods address this challenge by using an evolutionary model to infer a phylogeny while simultaneously imputing missing data and correcting errors in the observed SNVs. Algorithms such as SCITE 21 , On-coNEM 22 , SciΦ 23 , and B-SCITE 24 use the simplest phylogenetic model for SNVs, the infinite sites model.…”
Section: Missionmentioning
confidence: 99%
“…as well. Most existing methods 21,22,24,28,[31][32][33] for single-cell phylogeny inference discretize read counts into an observed mutation matrixB, using either two or three genotypes in addition to missing data (i.e, b v,a ∈ {0, 1, ?} orb v,a ∈ {00, 01, 11, ?}).…”
Section: Maximum Likelihood Loss-supported Refinement Problemmentioning
confidence: 99%
“…To account for the noise in the sequencing data, we develop a probabilistic model and an MCMC inference scheme for singlecell read counts (Methods). A major difference to MCMC schemes used for reconstructing trees of point mutations [22,18] is that the infinite sites assumption, which excludes the possibility of multiple mutational hits at the same genomic site, can no longer be made [19]. CNAs may overlap and nest inside each other, so the model developed here allows for arbitrary violations of the infinite sites assumption and arbitrary reoccurrences of amplifications and deletions across different genomic regions.…”
Section: Model Overviewmentioning
confidence: 99%
“…We collectively refer to these methods as "ITH methods" in the following. Subclonal reconstruction from single cell sequencing has emerged as a new field, simplifying part of the inference problem, but raising other issues, related to technical limitations (high dropout rate) and high cost, possibly a limitation to the availability of large cohorts [10,11,12,3].…”
Section: Introductionmentioning
confidence: 99%