2006
DOI: 10.1016/j.patcog.2006.02.014
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of structural classes for protein sequences and domains—Impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
140
1

Year Published

2007
2007
2015
2015

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 152 publications
(145 citation statements)
references
References 60 publications
(124 reference statements)
4
140
1
Order By: Relevance
“…34 However, compared with the results of the ''359'' dataset, the overall predictive accuracy of the ''1189'' dataset is decreasing, which is consistent with that reported previously. 34,35 Kurgan and Homaeian used the SVM as the classifier, the results of the jackknife test for the ''359'' dataset and ''1189'' dataset were 97.0% and 52.3%, respectively. 34 Kanaka et al have developed a very powerful method, called StackingC ensemble, for predicting protein structural class, the overall success rate was 96.4% for the ''359'' dataset, while on the dataset of ''1189'' proteins, the overall success rate was only about 58.9%.…”
Section: Effect Of Sequence Homology On Resultssupporting
confidence: 89%
See 2 more Smart Citations
“…34 However, compared with the results of the ''359'' dataset, the overall predictive accuracy of the ''1189'' dataset is decreasing, which is consistent with that reported previously. 34,35 Kurgan and Homaeian used the SVM as the classifier, the results of the jackknife test for the ''359'' dataset and ''1189'' dataset were 97.0% and 52.3%, respectively. 34 Kanaka et al have developed a very powerful method, called StackingC ensemble, for predicting protein structural class, the overall success rate was 96.4% for the ''359'' dataset, while on the dataset of ''1189'' proteins, the overall success rate was only about 58.9%.…”
Section: Effect Of Sequence Homology On Resultssupporting
confidence: 89%
“…Meanwhile, a nonredundant dataset of 1189 protein domains is used to evaluate the performance of the proposed method. The overall predictive accuracy achieved for the jackknife test is as high as 59.2%, which is higher than those by the StackingC ensemble, 35 about 2% higher than those by FKNN classifier 39 and about 6% higher than those by Logistic regression, 34 Bayer classifier 10 and SVM. 34 However, compared with the results of the ''359'' dataset, the overall predictive accuracy of the ''1189'' dataset is decreasing, which is consistent with that reported previously.…”
Section: Effect Of Sequence Homology On Resultsmentioning
confidence: 78%
See 1 more Smart Citation
“…In order to test the current method strictly and facilitate the comparison, four benchmark datasets, Z277 [6], Z498 [6], 1189 [29] and 25PDB [40], were adopted as the working datasets. The former two datasets, constructed by Zhou (1998), contain 277 and 498 sequences respectively with pairwise sequence similarities of about 80%.…”
Section: Datasetsmentioning
confidence: 99%
“…The composition vector is a simple sequence representation which is widely used in prediction of various aspects of protein structure [30][31][32][33][34][35][36][37]. The vector is composed of the twenty amino acids, alphabetically ordered, and stores the number of occurrences of the amino acid in the sequence window (in our case the tetrapeptide).…”
Section: Composition Vectormentioning
confidence: 99%