1999
DOI: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation and improvement of multiple sequence methods for protein secondary structure prediction

Abstract: A new dataset of 396 protein domains is developed and used to evaluate the performance of the protein secondary structure prediction algorithms DSC, PHD, NNSSP, and PREDATOR. The maximum theoretical Q 3 accuracy for combination of these methods is shown to be 78%. A simple consensus prediction on the 396 domains, with automatically generated multiple sequence alignments gives an average Q 3 prediction accuracy of 72.9%. This is a 1% improvement over PHD, which was the best single method evaluated. Segment Over… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
520
0
9

Year Published

1999
1999
2014
2014

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 615 publications
(534 citation statements)
references
References 84 publications
5
520
0
9
Order By: Relevance
“…Of these, one, Gln-7 to Pro (NS3-Q7P), was designed to disrupt the native structural motif of the first 14 amino acids. The other two substitutions, Arg-8 to Glu and Glu-10 to Ala (NS3-R8E, NS3-E10A), were made in an attempt to maintain the overall protein structure as predicted by using JPRED (30).…”
Section: Site-directed Mutagenesis Of the P11-binding Motif Of Ns3 Andmentioning
confidence: 99%
“…Of these, one, Gln-7 to Pro (NS3-Q7P), was designed to disrupt the native structural motif of the first 14 amino acids. The other two substitutions, Arg-8 to Glu and Glu-10 to Ala (NS3-R8E, NS3-E10A), were made in an attempt to maintain the overall protein structure as predicted by using JPRED (30).…”
Section: Site-directed Mutagenesis Of the P11-binding Motif Of Ns3 Andmentioning
confidence: 99%
“…Indeed, by using a combination of techniques such as multiple sequence alignment and neural networks, one can predict the basic secondary structural features, e.g., ␣-helices and ␤-strands, with a confidence that exceeds 70%. [11][12][13][14] Our simplified task could be thought of as a step in the ultimate goal of ab initio tertiary structure prediction. As we shall demonstrate, an important attendant advantage of such a simplified challenge is that it allows one to glean fundamental insights into the protein folding problem.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, when treated as a binary-class classification problem, the data set contains few positive examples. 2 The set of 513 protein sequences was constructed by [3], which includes almost all the sequences in the RS126 dataset. It contains 84, 119 residues of which 22.7% are β-sheets.…”
Section: Data Sets and Experiments Setupmentioning
confidence: 99%