2013
DOI: 10.3390/ijms140714892
|View full text |Cite
|
Sign up to set email alerts
|

Structure Prediction of Partial-Length Protein Sequences

Abstract: Protein structure information is essential to understand protein function. Computational methods to accurately predict protein structure from the sequence have primarily been evaluated on protein sequences representing full-length native proteins. Here, we demonstrate that top-performing structure prediction methods can accurately predict the partial structures of proteins encoded by sequences that contain approximately 50% or more of the full-length protein sequence. We hypothesize that structure prediction m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(8 citation statements)
references
References 40 publications
0
8
0
Order By: Relevance
“…The tool that was used in this study to predict the protein structures of the partial proteins was I-TASSER, since it was reported to reliably predict partial structures of a source protein [ 23 ]. I-TASSER initiates the structure prediction process by performing threading to identify template Protein Database Structures that are similar to the query sequence.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…The tool that was used in this study to predict the protein structures of the partial proteins was I-TASSER, since it was reported to reliably predict partial structures of a source protein [ 23 ]. I-TASSER initiates the structure prediction process by performing threading to identify template Protein Database Structures that are similar to the query sequence.…”
Section: Discussionmentioning
confidence: 99%
“…I-TASSER initiates the structure prediction process by performing threading to identify template Protein Database Structures that are similar to the query sequence. Structural fragments that are cut from the chosen templates are then used to construct the protein models [ 23 ]. The C-score output of I-TASSER was used to distinguish the foldable and non-foldable sequences because it is founded on the probability that the sequence is homologous to known structures at the subsequence level rather than globally.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…A primary and a very crucial step of novel protein design involves the computational identification or generation of potential protein sequences having a considerably high probability of mimicking the naturally occurring proteins and eventually folding into a compact structure 18,19 . Some of the earlier studies measured the degree of randomness in the known protein sequences to explore the logical explanations with conflicting inferences for constrained available sequence space 5,7,8,19‐22 . Some of the previous computational studies of random sequence proteins argued over the extent of variability among natural protein sequences and random protein sequences 23‐25 .…”
Section: Introductionmentioning
confidence: 99%