2017
DOI: 10.3390/molecules22101602
|View full text |Cite
|
Sign up to set email alerts
|

Identification of DNA-Binding Proteins Using Mixed Feature Representation Methods

Abstract: DNA-binding proteins play vital roles in cellular processes, such as DNA packaging, replication, transcription, regulation, and other DNA-associated activities. The current main prediction method is based on machine learning, and its accuracy mainly depends on the features extraction method. Therefore, using an efficient feature representation method is important to enhance the classification accuracy. However, existing feature representation methods cannot efficiently distinguish DNA-binding proteins from non… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
36
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 35 publications
(36 citation statements)
references
References 33 publications
(46 reference statements)
0
36
0
Order By: Relevance
“…DNA binding protein is a kind of special protein molecule, whose identification is one of the most important tasks in studying the function of proteins. In this regard, many computational predictors have been proposed [ 15 , 16 , 17 , 18 , 19 , 20 , 21 ]. In a special issue, Zhang et al [ 22 ] proposed a new approach to extract evolutionary information from the Position Specific Frequency Matrix (PSFM) and incorporate the evolutionary information, and a computational predictor was proposed for DNA binding protein identification.…”
Section: Machine Learning Related Researchesmentioning
confidence: 99%
“…DNA binding protein is a kind of special protein molecule, whose identification is one of the most important tasks in studying the function of proteins. In this regard, many computational predictors have been proposed [ 15 , 16 , 17 , 18 , 19 , 20 , 21 ]. In a special issue, Zhang et al [ 22 ] proposed a new approach to extract evolutionary information from the Position Specific Frequency Matrix (PSFM) and incorporate the evolutionary information, and a computational predictor was proposed for DNA binding protein identification.…”
Section: Machine Learning Related Researchesmentioning
confidence: 99%
“…These methods primarily focus on the following two aspects: (1) the construction of encoding schemes for protein sequences and (2) the application of classification algorithms. Many machine learning techniques have been adopted to perform the prediction of DBPs, including support vector machine (SVM) [5][6][7], random forest (RF) [8][9][10], naive Bayes classifier [4], ensemble classifiers [11][12][13], and deep learning [14][15][16]. Among these algorithms, SVM and RF have been widely used because of their excellent performance.…”
Section: Introductionmentioning
confidence: 99%
“…Prediction of DBPs is usually formulated as a supervised learning problem. In recent years, many classification algorithms have been adopted to solve this problem, including support vector machine (SVM) [24][25][26], random forest (RF) [27,28], naive Bayes classifier [3], ensemble classifiers [29][30][31], and deep learning [32][33][34]. Among these models, stacked generalization (or stacking) is an ensemble learning technique that takes the outputs of base classifiers as input and attempts to find the optimal combination of the base learners to make a better prediction [35].…”
Section: Introductionmentioning
confidence: 99%