2018
DOI: 10.1101/326918
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The EVcouplings Python framework for coevolutionary sequence analysis

Abstract: Summary: Coevolutionary sequence analysis has become a commonly used technique for de novo prediction of the structure and function of proteins, RNA, and protein complexes. This approach requires extensive computational pipelines that integrate multiple tools, databases, and data processing steps. We present the EVcouplings framework, a fully integrated open-source application and Python package for coevolutionary analysis. The framework enables generation of sequence alignments, calculation and evaluation of … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
111
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 67 publications
(112 citation statements)
references
References 25 publications
1
111
0
Order By: Relevance
“…There were three evolutionary covariation methods taking into consideration IMPContact, namely, ELSC [40], MI [41], and OMES [42]. In spite of the many similar methods providing computational tools, e.g., EVFOLD [43], which obtained an even higher accuracy on covariation, those 3 methods are more convenient to realize an easy-to-use particular IHRC method for the users. For the purpose of discovering the covariation residue pairs from the multiple alignment result, ELSC (Explicit Likelihood of Subset Covariation) uses combinatorial arguments to realize a perturbative algorithm [44].…”
Section: Resultsmentioning
confidence: 99%
“…There were three evolutionary covariation methods taking into consideration IMPContact, namely, ELSC [40], MI [41], and OMES [42]. In spite of the many similar methods providing computational tools, e.g., EVFOLD [43], which obtained an even higher accuracy on covariation, those 3 methods are more convenient to realize an easy-to-use particular IHRC method for the users. For the purpose of discovering the covariation residue pairs from the multiple alignment result, ELSC (Explicit Likelihood of Subset Covariation) uses combinatorial arguments to realize a perturbative algorithm [44].…”
Section: Resultsmentioning
confidence: 99%
“…We will also consider permanently including "coevolution-based" features into our algorithm, by either querying DBs of precomputed scores, as done with EVmutation, or by implementing interfaces to external libraries for Direct Coupling Analysis (DCA) [EVcouplings (Hopf et al, 2018), PconsC4 (Michel et al, 2018), Gremlin (Ovchinnikov et al, 2014), plmDCA (Ekeberg et al, 2014)]. A direct implementation would allow for much more flexibility in both fine-tuning the feature definitions, for instance by introducing normalizations that were shown to improve their predictive power (Feinauer and Weigt, 2017), and in the selection of MSAs for DCA calculations.…”
Section: Discussionmentioning
confidence: 99%
“…We resorted to an unsupervised method for predicting the mutation effects that specifically captures residue dependencies between positions. On Python 3 we deployed EVcoupling to assess the quantitative effects of mutations in RBD (23). Deep mutational scan was performed to understand the effects of substitution mutations in RBD and generate the mutational landscape.…”
Section: …… (2)mentioning
confidence: 99%
“…Z is to normalize the distribution to sum to one over all possible sequences of a fixed length. E(σ) was represented as (negative) energy of a model from statistical physics or as proportional to the scaled fitness N e F in toy, equilibrium models of population genetics (23). We used the energy function E(σ) with two types of constraints viz.…”
Section: …… (2)mentioning
confidence: 99%