2020
DOI: 10.1093/bioinformatics/btaa433
|View full text |Cite
|
Sign up to set email alerts
|

Cancer mutational signatures representation by large-scale context embedding

Abstract: Motivation The accumulation of somatic mutations plays critical roles in cancer development and progression. However, the global patterns of somatic mutations, especially non-coding mutations, and their roles in defining molecular subtypes of cancer have not been well characterized due to the computational challenges in analysing the complex mutational patterns. Results Here, we develop a new algorithm, called MutSpace, to ef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(19 citation statements)
references
References 34 publications
3
11
0
Order By: Relevance
“…Nucleotide mutation is an important genetic process showing potentially causal association with diseases like cancer, autism spectrum disorder, and Alzheimer's disease (1)(2)(3)(4)(5)(6)(7). Single nucleotide mutation rate ("mutation rate") represents the probability of a single nucleotide mutating in an individual genome.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Nucleotide mutation is an important genetic process showing potentially causal association with diseases like cancer, autism spectrum disorder, and Alzheimer's disease (1)(2)(3)(4)(5)(6)(7). Single nucleotide mutation rate ("mutation rate") represents the probability of a single nucleotide mutating in an individual genome.…”
Section: Introductionmentioning
confidence: 99%
“…Predictive modeling of mutation rates and explaining the source of its genomewide variations is an essential goal in human genetics. This understanding is the key to studying evolutionary divergence between species, inferring ancestral states, detecting adaptive evolution (8)(9)(10), identifying functional elements in the genome and predicting deleterious single nucleotide variants in the human genome (1)(2)(3)8), and identifying disease subtypes (4)(5)(6)(7). Predicting the high variations in genome-wide mutation rates is known to be difficult.…”
Section: Introductionmentioning
confidence: 99%
“…In performance evaluation using the ENCODE ChIP-seq dataset, BindSpace achieved high classification performance even between paralogous TFs, which contain highly similar binding motifs. The second application is MutSpace , which is used to estimate the cancer types of patients from somatic mutation patterns [67] . This method regarded mutation patterns and cancer types as words and labels, respectively.…”
Section: Survey Of Representation Learning Applications In Sequence Analysismentioning
confidence: 99%
“…In performance evaluation using the EN-CODE ChIP-seq dataset, BindSpace achieved high classification performance even between paralogous TFs, which have highly similar binding motifs. The second application is MutSpace, which estimates the cancer types of patients from somatic mutation patterns [61]. This method regarded mutation patterns and cancer types as words and labels, respectively.…”
Section: Applications For Other Tasksmentioning
confidence: 99%