2019
DOI: 10.1093/bioinformatics/btz266
|View full text |Cite
|
Sign up to set email alerts
|

Discovering novel mutation signatures by latent Dirichlet allocation with variational Bayes inference

Abstract: MotivationA cancer genome includes many mutations derived from various mutagens and mutational processes, leading to specific mutation patterns. It is known that each mutational process leads to characteristic mutations, and when a mutational process has preferences for mutations, this situation is called a ‘mutation signature.’ Identification of mutation signatures is an important task for elucidation of carcinogenic mechanisms. In previous studies, analyses with statistical approaches (e.g. non-negative matr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 30 publications
0
13
0
Order By: Relevance
“…The original data contains 2780 samples for single nucleotide substitution (SBS) counts. We removed samples with few mutations (less than 400 mutations) as a preprocessing because the presence of samples with low-variant interferes with learning in LDA-based approaches [ 16 ], and we obtained 2653 samples. Figure 3 shows the number of samples contained in each tumor type.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…The original data contains 2780 samples for single nucleotide substitution (SBS) counts. We removed samples with few mutations (less than 400 mutations) as a preprocessing because the presence of samples with low-variant interferes with learning in LDA-based approaches [ 16 ], and we obtained 2653 samples. Figure 3 shows the number of samples contained in each tumor type.…”
Section: Resultsmentioning
confidence: 99%
“…Figure 2 shows the graphical model of PLDA. A detailed notation of each parameter is provided in our previous study [ 16 ]. The vector is the parameter of K -dimensional multinomial distribution, indicating the signature activity of the s th sample in l th tumour type ( ).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations