2020
DOI: 10.1186/s12859-020-03772-3
|View full text |Cite
|
Sign up to set email alerts
|

Generating realistic null hypothesis of cancer mutational landscapes using SigProfilerSimulator

Abstract: Background Performing a statistical test requires a null hypothesis. In cancer genomics, a key challenge is the fast generation of accurate somatic mutational landscapes that can be used as a realistic null hypothesis for making biological discoveries. Results Here we present SigProfilerSimulator, a powerful tool that is capable of simulating the mutational landscapes of thousands of cancer genomes at different resolutions within seconds. Applying SigProfilerSimulator to 2144 whole-genome sequenced cancers … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
37
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 34 publications
(37 citation statements)
references
References 21 publications
0
37
0
Order By: Relevance
“…The reported comparisons for SBS-96 scenarios rely on a cosine similarity ≥0.90 for determining TP signatures and <0.90 for determining FP signatures. Note that a cosine similarity ≥0.90 is highly unlikely to happen purely by chance (p-value = 5.90 x 10 -9 ) as two random nonnegative vectors are expected to have an average cosine similarity of 0.75 purely by chance 33 . Importantly, SigProfilerExtractor’s performance does not depend on the specific value of the cosine similarity threshold ( Figure 3 a ) as the tool consistently outperforms other bioinformatics approaches for almost any value of the threshold above 0.80 (p-value: 0.057).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The reported comparisons for SBS-96 scenarios rely on a cosine similarity ≥0.90 for determining TP signatures and <0.90 for determining FP signatures. Note that a cosine similarity ≥0.90 is highly unlikely to happen purely by chance (p-value = 5.90 x 10 -9 ) as two random nonnegative vectors are expected to have an average cosine similarity of 0.75 purely by chance 33 . Importantly, SigProfilerExtractor’s performance does not depend on the specific value of the cosine similarity threshold ( Figure 3 a ) as the tool consistently outperforms other bioinformatics approaches for almost any value of the threshold above 0.80 (p-value: 0.057).…”
Section: Resultsmentioning
confidence: 99%
“…Cosine similarity was used to compare the profiles of different mutational signatures. P-values can be attributed to cosine similarities based on a null hypothesis of uniform random distribution of nonnegative vectors 33 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The remaining set of somatic mutations were used in the subsequent analyses and evaluation for mutational signatures. Analysis of mutational signatures was performed using our previously derived set of reference mutational signatures 33 as well as our previously established methodology with the SigProfiler suite of tools used for summarization, simulation, visualization, and extraction of mutational signatures 5456 .…”
Section: Methodsmentioning
confidence: 99%
“…Mutation simulation was performed by SigProfilerSimulator 50 to preclude the bias of tri-nucleotide composition which could affect the mutation distribution in local regions. Briefly, the total number of simulated mutations for each sample is equal to the observed mutations, but the position of the mutation is relocated according to the frequency of tri-nucleotide context along the given region.…”
Section: Methodsmentioning
confidence: 99%