2017
DOI: 10.1016/j.gene.2017.03.011
|View full text |Cite
|
Sign up to set email alerts
|

A new hybrid coding for protein secondary structure prediction based on primary structure similarity

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(9 citation statements)
references
References 18 publications
0
9
0
Order By: Relevance
“…Bouziane et al [7] also investigated the effect of two input data types including Position-Specific Scoring Matrix (PSSM) [38] and a coding scheme which is used to represent the amino-acids. -Support Vector Machine Using Hybrid Coding for Protein (SVM-HC): The method proposed by Li et al [30] initially employs the geometry-based similarities and where the similarity comparison is not applicable, the SVM module will be used to leverage the final protein structure. Moreover, SVM-HC extracts a 6-bit code from the physiochemical properties of both amino-acids and the tendency factors.…”
Section: Comparison and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Bouziane et al [7] also investigated the effect of two input data types including Position-Specific Scoring Matrix (PSSM) [38] and a coding scheme which is used to represent the amino-acids. -Support Vector Machine Using Hybrid Coding for Protein (SVM-HC): The method proposed by Li et al [30] initially employs the geometry-based similarities and where the similarity comparison is not applicable, the SVM module will be used to leverage the final protein structure. Moreover, SVM-HC extracts a 6-bit code from the physiochemical properties of both amino-acids and the tendency factors.…”
Section: Comparison and Discussionmentioning
confidence: 99%
“…Feature extraction from protein sequences is the first step when applying a machine learning approach. However, the extracted features may not reflect all the information a sequence contains and thus can lead to information loss [10,30,33]. Nevertheless, from biological perspective, protein sequence contains indispensable information to adopt certain structures [31,33,34].…”
Section: Introductionmentioning
confidence: 99%
“…We did not perform additional attribute evaluations with other methods, such as PCA 28 , because our set of attributes is already small, i.e., this is not the case of high dimensionality issues. Furthermore, the IG analysis and the density plots clearly demonstrate that all nine classification attributes chosen in this work present some degree of relevance.…”
Section: Methodsmentioning
confidence: 99%
“…For example, Ref [25] proposed an ensemble method that combines four predictors and uses a random forest classifier for the prediction of human sub-cellular localization. Four predictors, based either on support vector machine [26] or naive Bayes [27], employed different features of protein to make predictions.…”
Section: The Ensemble Algorithm Based On Bi-lstmmentioning
confidence: 99%