2016
DOI: 10.1371/journal.pone.0167345
|View full text |Cite
|
Sign up to set email alerts
|

DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues

Abstract: DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
38
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 42 publications
(38 citation statements)
references
References 54 publications
0
38
0
Order By: Relevance
“…Recognizing specific DNA sequences by DNA binding proteins (DBPs) is critical for fundamental biological activities like transcription, DNA repair, and DNA replication. Based on a complex model resolved from X‐ray crystallography or NMR experiments, multiple knowledge‐ and physics‐based algorithms have already been developed to analyze protein‐DNA complex, 50‐53 predict protein binding sites 54‐58 or aid DBP design 59‐62 . The specific recognition of DNA by DBP is often regulated by mutations or post‐translational modifications (PTMs) 63,64 .…”
Section: Discussionmentioning
confidence: 99%
“…Recognizing specific DNA sequences by DNA binding proteins (DBPs) is critical for fundamental biological activities like transcription, DNA repair, and DNA replication. Based on a complex model resolved from X‐ray crystallography or NMR experiments, multiple knowledge‐ and physics‐based algorithms have already been developed to analyze protein‐DNA complex, 50‐53 predict protein binding sites 54‐58 or aid DBP design 59‐62 . The specific recognition of DNA by DBP is often regulated by mutations or post‐translational modifications (PTMs) 63,64 .…”
Section: Discussionmentioning
confidence: 99%
“…In recent years, computational methods based on machine learning (ML) algorithms have become more and more popular because of their promising performance. According to various feature information, the ML-based approachs are mainly composed of structure information-based [ 8 18 ] and sequence information-based method [ 1 , 2 , 19 35 ].…”
Section: Introductionmentioning
confidence: 99%
“…Lou et al proposed a prediction method of DNA-binding proteins by performing the feature rank using random forest and the wrapper-based feature selection using a forward best-first search strategy [ 6 ]. Ma et al used the random forest classifier with a hybrid feature set by incorporating binding propensity of DNA-binding residues [ 7 ]. Professor Liu’s group developed several novel tools for predicting DNA-Binding proteins, such as iDNA-Prot|dis by incorporating amino acid distance-pairs and reducing alphabet profiles into the general pseudo amino acid composition [ 8 ], PseDNA-Pro by combining PseAAC and physiochemical distance transformations [ 9 ], iDNAPro-PseAAC by combining pseudo amino acid composition and profile-based protein representation [ 10 ], iDNA-KACC by combining auto-cross covariance transformation and ensemble learning [ 11 ].…”
Section: Introductionmentioning
confidence: 99%