2016
DOI: 10.1002/pro.2920
|View full text |Cite
|
Sign up to set email alerts
|

Intermediate divergence levels maximize the strength of structure–sequence correlations in enzymes and viral proteins

Abstract: Structural properties such as solvent accessibility and contact number predict site-specific sequence variability in many proteins. However, the strength and significance of these structuresequence relationships vary widely among different proteins, with absolute correlation strengths ranging from 0 to 0.8. In particular, two recent works have made contradictory observations. Yeh et al. (Mol. Biol. Evol. 31:135-139, 2014) found that both relative solvent accessibility (RSA) and weighted contact number (WCN) … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
8
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
3
2

Relationship

4
1

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 63 publications
1
8
0
Order By: Relevance
“…Interestingly, this result contrasts somewhat with identified 335 predictors of protein substitution rate, where protein design has had minimal predictive 336 ability compared to WCN or RSA [31,32]. For deletions, although WCN was still a 337 better single predictor than was protein design mean score, including protein design in 338 multi-predictor models led to consistent model improvement (Tables 1 and 2).…”
mentioning
confidence: 90%
“…Interestingly, this result contrasts somewhat with identified 335 predictors of protein substitution rate, where protein design has had minimal predictive 336 ability compared to WCN or RSA [31,32]. For deletions, although WCN was still a 337 better single predictor than was protein design mean score, including protein design in 338 multi-predictor models led to consistent model improvement (Tables 1 and 2).…”
mentioning
confidence: 90%
“…Interestingly, this result contrasts somewhat with identified predictors of protein substitution rate, where protein design has had minimal predictive ability compared to WCN or RSA [33, 34]. For deletions, although WCN was still a better single predictor than was protein design mean score, including protein design in multi-predictor models led to consistent model improvement (Tables 1 and 2).…”
Section: Discussionmentioning
confidence: 74%
“…Studies which have examined the correlations between site-specific dN/dS, or conversely amino-acid level evolutionary rates, and such structural quantities have recovered correlations with strengths widely ranging from 0.1-0.8 (Shih and Hwang 2012;Huang et al 2014;Yeh et al 2014b,a;Shahmoradi et al 2014;Meyer and Wilke 2015b,c;Jackson et al 2016). Recent work by Jackson et al (2016) has attributed the source of this wide range of correlations directly to the extent of sequence divergence present in a given dataset, such that more diverged datasets display higher correlations and less diverged datasets display lower correlations. Our findings that increased divergence levels contribute strongly to dN/dS inference accuracy are fully consistent with those of Jackson et al (2016).…”
Section: Discussionmentioning
confidence: 99%
“…Recent work by Jackson et al (2016) has attributed the source of this wide range of correlations directly to the extent of sequence divergence present in a given dataset, such that more diverged datasets display higher correlations and less diverged datasets display lower correlations. Our findings that increased divergence levels contribute strongly to dN/dS inference accuracy are fully consistent with those of Jackson et al (2016). Therefore, we suggest that future work examining the relationship between protein evolutionary rate and structure should focus on datasets with intermediate-to-high divergences, which are most likely to provide meaningful information about long-term evolutionary constraints.…”
Section: Discussionmentioning
confidence: 99%