2011
DOI: 10.1371/journal.pone.0028898
|View full text |Cite
|
Sign up to set email alerts
|

Non-Negative Matrix Factorization for Learning Alignment-Specific Models of Protein Evolution

Abstract: Models of protein evolution currently come in two flavors: generalist and specialist. Generalist models (e.g. PAM, JTT, WAG) adopt a one-size-fits-all approach, where a single model is estimated from a number of different protein alignments. Specialist models (e.g. mtREV, rtREV, HIVbetween) can be estimated when a large quantity of data are available for a single organism or gene, and are intended for use on that organism or gene only. Unsurprisingly, specialist models outperform generalist models, but in most… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…The problem of performing a nonnegative matrix factorization is not always tractable, nevertheless numerical approximations of this task exist. This method is used in many fields of science, such as astronomy, bioinformatics and machine learning, as they study data with positive entries [3,4,17]. Furthermore, the nonnegative rank, which we define below, plays a critical role in communication theory as it characterizes randomized communication complexity and randomized correlation complexity [14,27].…”
Section: Nonnegative Factorizationmentioning
confidence: 99%
“…The problem of performing a nonnegative matrix factorization is not always tractable, nevertheless numerical approximations of this task exist. This method is used in many fields of science, such as astronomy, bioinformatics and machine learning, as they study data with positive entries [3,4,17]. Furthermore, the nonnegative rank, which we define below, plays a critical role in communication theory as it characterizes randomized communication complexity and randomized correlation complexity [14,27].…”
Section: Nonnegative Factorizationmentioning
confidence: 99%
“…Furthermore, since most CRESS DNA viruses have very few ORFs, often only a CP and a Rep, their sequence diversity contributes little to the overall Pfam database. While general models do increasingly well at describing the patterns dominated by genetic code constraints and the physiochemical properties of the amino acids (Murrell et al, 2011), there will always be opportunities for specific matrices to resolve protein evolution in biological entities that have more unique lifestyles and constraints, such as fast-evolving CRESS DNA viruses with single-stranded mutational biases (Cardinale et al, 2013;Frederico et al, 1990;Xia and Yuen, 2005).…”
Section: Relationships Among Other Cress Dna Viral Familiesmentioning
confidence: 99%
“…Acoustic levitator allows for studying in situ phase transformations because of its contact-less nature and ability to minimize external influences. The problem of finding the time-dependent concentration of intermediates of the nucleation process is equivalent to finding non-negative matrix factors of raw data of time-resolved Raman spectroscopy. Non-negative matrix factorization (NMF) is a linear algebraic method that finds applications in various scientific disciplines such as in the correction of astronomical spectroscopy, document clustering, and bioinformatics for latent component detection. The two resulting factors of NMF provide qualitative and quantitative contribution of a single feature (latent components) to a complex phenomenon. In order to compute these contributions, Weber et al developed a robust clustering method known as PCCA+.…”
Section: Introductionmentioning
confidence: 99%