2018
DOI: 10.1101/381483
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Application of an Interpretable Classification Model on Early Folding Residues during Protein Folding

Abstract: Background: Machine learning strategies are prominent tools for data analysis. Especially in life sciences, they have become increasingly important to handle the growing datasets collected by the scientific community. Meanwhile, algorithms improve in performance, but also gain complexity, and tend to neglect interpretability and comprehensiveness of the resulting models.Results: Generalized Matrix Learning Vector Quantization (GMLVQ) is a supervised, prototype-based machine learning method and provides compreh… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 52 publications
0
3
0
Order By: Relevance
“…It is conceivable that the discrimination between glutamic and aspartic acid was based on tertiary contacts between secondary structures elements and size selectivity rather than on specific side chain interactions 82 . The recent identification of a protein folding motif 83 strengthens this assumption. This is further supported by the observation that ancient proteins, based on a limited set of amino acids, were still capable to exhibit secondary structures 81 , 84 , 85 .…”
Section: Discussionmentioning
confidence: 82%
“…It is conceivable that the discrimination between glutamic and aspartic acid was based on tertiary contacts between secondary structures elements and size selectivity rather than on specific side chain interactions 82 . The recent identification of a protein folding motif 83 strengthens this assumption. This is further supported by the observation that ancient proteins, based on a limited set of amino acids, were still capable to exhibit secondary structures 81 , 84 , 85 .…”
Section: Discussionmentioning
confidence: 82%
“…Contact maps can be used as a set of constraints for ab initio structure prediction. They show predicted contacts based on co-evolving residues which imply their spatial proximity and are derived from a multiple sequence alignment [6].…”
Section: /14mentioning
confidence: 99%
“…GMLVQ is known to be robust and easy to interpret [46,58]. In biomedical context it was successfully applied to analyse flow cytometry data and to detect early folding residues during protein folding [4,7].…”
Section: /14mentioning
confidence: 99%