2018
DOI: 10.1016/j.jtbi.2018.02.002
|View full text |Cite
|
Sign up to set email alerts
|

EvoStruct-Sub: An accurate Gram-positive protein subcellular localization predictor using evolutionary and structural features

Abstract: Determining subcellular localization of proteins is considered as an important step towards understanding their functions. Previous studies have mainly focused solely on Gene Ontology (GO) as the main feature to tackle this problem. However, it was shown that features extracted based on GO is hard to be used for new proteins with unknown GO. At the same time, evolutionary information extracted from Position Specific Scoring Matrix (PSSM) have been shown as another effective features to tackle this problem. Des… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
8
1

Relationship

3
6

Authors

Journals

citations
Cited by 32 publications
(11 citation statements)
references
References 104 publications
0
11
0
Order By: Relevance
“…Support Vector Machine (SVM) was first proposed by Vapnik et al in 1995 [45], and is a ML method based on Vapnik-Chervonenkis (VC) dimension theory and the principle of structural risk minimization. It was first applied to classical classification problems and showing promise in solving nonlinear and high dimensional problems, then the method was applied to common regression problems [46, 47]. An SVM applied to nonlinear regression is called Support Vector Regression (SVR).…”
Section: Methodsmentioning
confidence: 99%
“…Support Vector Machine (SVM) was first proposed by Vapnik et al in 1995 [45], and is a ML method based on Vapnik-Chervonenkis (VC) dimension theory and the principle of structural risk minimization. It was first applied to classical classification problems and showing promise in solving nonlinear and high dimensional problems, then the method was applied to common regression problems [46, 47]. An SVM applied to nonlinear regression is called Support Vector Regression (SVR).…”
Section: Methodsmentioning
confidence: 99%
“…However, all these methods use information from the primary sequence or the frequencies of amino acid residues or nucleotides in each side of the window. Secondary Structural information was earlier used with primary sequence information in problems of predicting protein binding sites, succinylation, phosphorylation, drug‐target interaction, DNA‐binding protein identification, and protein sub‐cellular localization . To the best of our knowledge, secondary structural information and physico‐chemical properties are not explored to predict lysine glycation of peptides.…”
Section: Methodsmentioning
confidence: 99%
“…These features are considered important source to provide information about the local interaction of amino acids along the protein sequence. Also, they have been used in different studies to tackle different problems in protein science and attained promising results 5356 . The subsequent sections below discuss these structural properties.…”
Section: Methodsmentioning
confidence: 99%