2020
DOI: 10.1038/s41598-020-59006-2
|View full text |Cite
|
Sign up to set email alerts
|

Power and pitfalls of computational methods for inferring clone phylogenies and mutation orders from bulk sequencing data

Abstract: tumors harbor extensive genetic heterogeneity in the form of distinct clone genotypes that arise over time and across different tissues and regions in cancer. Many computational methods produce clone phylogenies from population bulk sequencing data collected from multiple tumor samples from a patient. These clone phylogenies are used to infer mutation order and clone origins during tumor progression, rendering the selection of the appropriate clonal deconvolution method critical. Surprisingly, absolute and rel… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 74 publications
2
16
0
Order By: Relevance
“…This is also the case when n = 1000 and m = 300 when false negative rate is 0.05, however PhISCS fails to report solutions for some instances when false negative rate is 0.2. When m = 1000 the performance of SCITE (with a 24-hour time limit) under both measures falls substantially 4 Examples include the number of ancestor-descendant and different-lineage relationships shared across the trees compared, as well as more sophisticated measures such as the Multi-Labeled Tree Edit Distance and others [24,39,40,41,42]. 5 which is important especially since many methods build trees based on mutations shared by at least two cells or pre-cluster cells based on their mutational profiles 6 Another parameter used in our simulations is the missing value rate γ, which determines the fraction of entries in the genotype matrix which are unknown.…”
Section: Benchmarking Of Methods For Reconstructing Mutation Trees Of Tumor Progression On Simulated Datamentioning
confidence: 99%
See 1 more Smart Citation
“…This is also the case when n = 1000 and m = 300 when false negative rate is 0.05, however PhISCS fails to report solutions for some instances when false negative rate is 0.2. When m = 1000 the performance of SCITE (with a 24-hour time limit) under both measures falls substantially 4 Examples include the number of ancestor-descendant and different-lineage relationships shared across the trees compared, as well as more sophisticated measures such as the Multi-Labeled Tree Edit Distance and others [24,39,40,41,42]. 5 which is important especially since many methods build trees based on mutations shared by at least two cells or pre-cluster cells based on their mutational profiles 6 Another parameter used in our simulations is the missing value rate γ, which determines the fraction of entries in the genotype matrix which are unknown.…”
Section: Benchmarking Of Methods For Reconstructing Mutation Trees Of Tumor Progression On Simulated Datamentioning
confidence: 99%
“… 3 Examples include the number of ancestor-descendant and different-lineage relationships shared across the trees compared, as well as more sophisticated measures such as the Multi-Labeled Tree Edit Distance and others [23, 37, 38, 39, 40]. …”
mentioning
confidence: 99%
“…The evolution of a tumor is typically described by a phylogenetic tree, or phylogeny, whose leaves represent the cells observed at the present time and whose internal nodes represent ancestral cells. Tumor phylogenies are challenging to reconstruct using DNA sequencing data from bulk tumor samples, since this data contains mixtures of mutations from thousandsmillions of heterogeneous cells in the sample [3][4][5][6][7][8][9][10][11][12][13][14][15] . Recently, single-cell DNA sequencing (scDNA-seq) of tumors has become more common, and new technologies such as those from 10X Genomics 16…”
Section: Introductionmentioning
confidence: 99%
“…By the way, in the above, we assumed that the clone phylogeny is known. In empirical data analysis, one needs to generate it using available computational tools for bulk and single-cell sequencing methods; see reviews in the accuracy of methods (Miura et al, 2018a; Miura et al, 2018b; Miura et al, 2020). The errors in the collection of variants for each branch will lead to false-negative detection of signatures due to diluted signals caused by incorrect variants and correct variants that are not assigned to a branch.…”
Section: Methodsmentioning
confidence: 99%
“…Tumor cells accumulate somatic mutations during cancer progression, in which cells exhibit dynamic demography, including emergence, expansion, and extinction (Bailey et al, 2018; Martincorena & Campbell, 2015). Through the analysis of genomic variation, researchers now routinely reconstruct mutational histories and phylogenies of clones (Brown et al, 2017; El-Kebir et al, 2018; Miura et al, 2020; Turajlic et al, 2018; Zhao et al, 2016). In a clone phylogeny, variants can be localized to individual branches and relative frequencies of different variant types compared across lineages to detect shifts in cellular mutational processes over time.…”
Section: Introductionmentioning
confidence: 99%