2022
DOI: 10.1002/prot.26441
|View full text |Cite
|
Sign up to set email alerts
|

A sequence‐based foldability score combined with AlphaFold2 predictions to disentangle the protein order/disorder continuum

Abstract: Order and disorder govern protein functions, but there is a great diversity in disorder, from regions that are-and stay-fully disordered to conditional order. This diversity is still difficult to decipher even though it is encoded in the amino acid sequences. Here, we developed an analytic Python package, named pyHCA, to estimate the foldability of a protein segment from the only information of its amino acid sequence and based on a measure of its density in regular secondary structures associated with hydroph… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 117 publications
0
13
0
Order By: Relevance
“…30 A low pLDDT is considered to indicate low confidence of prediction and to correlate with higher disorder [39][40][41][42][43] and structural heterogeneity. [44][45][46][47] Accordingly, pLDDT has been used for large-scale studies of structural conformations. 39,42,43,48,49 Since de novo and random proteins are considered to be less compact, while containing secondary elements, and more disordered, 8,10,18 not only structure predictors but also appropriate disorder predictors are essential for computational analyses of their conformations.…”
Section: Introductionmentioning
confidence: 99%
“…30 A low pLDDT is considered to indicate low confidence of prediction and to correlate with higher disorder [39][40][41][42][43] and structural heterogeneity. [44][45][46][47] Accordingly, pLDDT has been used for large-scale studies of structural conformations. 39,42,43,48,49 Since de novo and random proteins are considered to be less compact, while containing secondary elements, and more disordered, 8,10,18 not only structure predictors but also appropriate disorder predictors are essential for computational analyses of their conformations.…”
Section: Introductionmentioning
confidence: 99%
“…Hydrophobic cluster analysis (HCA 35,36 ) was also used to assess the foldability of the analyzed sequences. 37,38 2.3 | Calcyanin detection and classification (Supplementary Figure 1) Sequences of calcyanins were detected in our dataset using a dedicated in-house tool called pCALF (standing for python CALcyanin Finder). pCALF uses four hidden Markov model (HMM) profiles describing the C-terminal glycine zipper triplication specific of calcyanins, called the (GlyZip) 3 motif.…”
Section: Annotation Of the Full-length Sequences (Figure 2b)mentioning
confidence: 99%
“…In the assessment by Akdel et al, around half the residues presented a low confidence (<70) score. In their recent work, Bruley et al , highlighted the possibility of “hidden order” cases, i.e., situations where low-confidence structural predictions are not related to disorder, but correspond to foldable domains that are not correctly predicted due to AF2 intrinsic limitations (such as a lack of coevolutionary information for the target sequence). In this case, one can combine AF2 predictions with an additional tool based on the residues physicochemical properties, such as their hydrophobicity in the case of the hydrophobic cluster analysis to unveil ordered segments that remain hidden from AF2.…”
Section: The Impact Of Ai-generated Models Beyond Molecular Replacementmentioning
confidence: 99%