2022
DOI: 10.3389/fgene.2021.821996
|View full text |Cite
|
Sign up to set email alerts
|

Identify DNA-Binding Proteins Through the Extreme Gradient Boosting Algorithm

Abstract: The exploration of DNA-binding proteins (DBPs) is an important aspect of studying biological life activities. Research on life activities requires the support of scientific research results on DBPs. The decline in many life activities is closely related to DBPs. Generally, the detection method for identifying DBPs is achieved through biochemical experiments. This method is inefficient and requires considerable manpower, material resources and time. At present, several computational approaches have been develop… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(10 citation statements)
references
References 94 publications
0
10
0
Order By: Relevance
“…Several methods have been implemented for the identification of DBPs. The proposed work is compared with past studies including iDNA-Prot [ 22 ], iDNA-Prot|dis [ 23 ], TargetDBP [ 59 ], MsDBP [ 60 ], PDBP-CNN [ 29 ], and XGBoost [ 30 ] and summarized the results in Table 4 . Our proposed study improved the accuracy by 4.82%, sensitivity by 10.58%, and MCC by 0.09 than the best predictor (PDBP-CNN).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Several methods have been implemented for the identification of DBPs. The proposed work is compared with past studies including iDNA-Prot [ 22 ], iDNA-Prot|dis [ 23 ], TargetDBP [ 59 ], MsDBP [ 60 ], PDBP-CNN [ 29 ], and XGBoost [ 30 ] and summarized the results in Table 4 . Our proposed study improved the accuracy by 4.82%, sensitivity by 10.58%, and MCC by 0.09 than the best predictor (PDBP-CNN).…”
Section: Resultsmentioning
confidence: 99%
“…Consequently, Li et al extracted features by a convolutional neural network (CNN) and Bi-LSTM [ 29 ]. Onward, Zhao et al the features of the proteins are analyzed by six methods and classification is performed with XGBoost [ 30 ]. Each computational method contributed well to enhancing the prediction of DBPs.…”
Section: Introductionmentioning
confidence: 99%
“…It can be used for regression, classification, and ranking problems. It is a ML model that integrates multiple weak learners to achieve a stronger learning effect ( 43 ). Compared with other traditional ML algorithms, XGBoost is highly scalable and flexible.…”
Section: Methodsmentioning
confidence: 99%
“…Different from the GBDT algorithm, which only uses the first derivative when optimizing the loss function, XGB is a second-order Taylor expansion of the loss function and adds the regular term to the objective function. Since the loss function of XGB can introduce regularization terms, the complexity of the model can be controlled, the variance of the model can be reduced, and the overfitting of the model can be prevented (Zhao et al, 2022). After each iteration, the learning rate is assigned to leaf nodes to reduce the weight of each tree, reduce the influence of each tree, and provide a better learning space for the following.…”
Section: Extreme Gradient Boostingmentioning
confidence: 99%