2007
DOI: 10.1186/1471-2105-8-404
|View full text |Cite
|
Sign up to set email alerts
|

Application of amino acid occurrence for discriminating different folding types of globular proteins

Abstract: BackgroundPredicting the three-dimensional structure of a protein from its amino acid sequence is a long-standing goal in computational/molecular biology. The discrimination of different structural classes and folding types are intermediate steps in protein structure prediction.ResultsIn this work, we have proposed a method based on linear discriminant analysis (LDA) for discriminating 30 different folding types of globular proteins using amino acid occurrence. Our method was tested with a non-redundant set of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
77
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 63 publications
(78 citation statements)
references
References 25 publications
1
77
0
Order By: Relevance
“…The RBF kernel parameters, gamma and , are optimized using LibSVM. The following feature sets are computed from the original protein sequences for the experiment: PF1, PF2 [21], PF [22], Occurrence (O) [19], AAC and [6]. We have used PSSM probabilities to find the consensus sequence for each of the original protein sequence in both of the datasets.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The RBF kernel parameters, gamma and , are optimized using LibSVM. The following feature sets are computed from the original protein sequences for the experiment: PF1, PF2 [21], PF [22], Occurrence (O) [19], AAC and [6]. We have used PSSM probabilities to find the consensus sequence for each of the original protein sequence in both of the datasets.…”
Section: Resultsmentioning
confidence: 99%
“…The TG dataset extracted by [19] consists of 1612 proteins belonging to 30 most populated folds from the SCOP 1.73. The sequence similarity of protein of TG datasets is no more than 25%.…”
Section: Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…Similarly, it has been pointed out that the co-occurrence information of amino acids in proteins is also important for capturing protein structures [6], [7], [24]. In addition, in most existing methods for PFR, feature vectors of amino acids typically include physicochemical information such as hydrophobicity, polarity, and van der Waals volume.…”
Section: Why Word Representations?mentioning
confidence: 99%