2019
DOI: 10.3389/fgene.2019.00637
|View full text |Cite
|
Sign up to set email alerts
|

PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees

Abstract: Protein-RNA interactions play essential roles in many biological aspects. Quantifying the binding affinity of protein-RNA complexes is helpful to the understanding of protein-RNA recognition mechanisms and identification of strong binding partners. Due to experimentally measured protein-RNA binding affinity data available is still limited to date, there is a pressing demand for accurate and reliable computational approaches. In this paper, we propose a computational approach, PredPRBA, which can effectively pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(23 citation statements)
references
References 55 publications
0
23
0
Order By: Relevance
“…GBDT can generate highly robust, interpretable and competitive classification procedures, especially for exploiting less than clean data [ 29 , 40 , 41 ]. For an lncRNA–protein pair , an estimator denotes an approximate function response to the label , the GBDT model iteratively builds K different individual decision tree using the training data .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…GBDT can generate highly robust, interpretable and competitive classification procedures, especially for exploiting less than clean data [ 29 , 40 , 41 ]. For an lncRNA–protein pair , an estimator denotes an approximate function response to the label , the GBDT model iteratively builds K different individual decision tree using the training data .…”
Section: Methodsmentioning
confidence: 99%
“…Fan and Zhang [ 28 ] explored a stacked ensemble-based LPI classification model via logistical regression (LPI-BLS). Deng et al [ 29 ] proposed a gradient boosted regression tree for finding possible LPIs. Wekesa et al [ 30 ] designed a categorical boosting-based LPI discovery framework (LPI-CatBoost).…”
Section: Introductionmentioning
confidence: 99%
“…Random forest is a generally acknowledged ensemble classifier for machine learning and can exploit large data repositories for the analysis of risk predictors and their intimate interactions and advancement risk prediction capability ( Yang et al, 2021 ). It was used in evaluating cervical cancer candidate genes as the main objects of our research ( Nunn et al, 2012 ; Wang et al, 2018 ; Deng et al, 2019b ; Ru et al, 2019 ; Lv et al, 2020 ; Yang et al, 2020 ). In the random forest setting, we divided the expression profile data of 103 genes according to the properties of the samples and considered the samples before radiotherapy as “good” and the samples during radiotherapy “poor,” respectively.…”
Section: Methodsmentioning
confidence: 99%
“…In this study, we incorporated ANOVA with incremental feature selection (IFS) [54]- [56] to pick out the optimal feature subset. The principle of ANOVA is to calculate the score (F value) of each feature in feature subset, which indicates the contribution of each feature to classification.…”
Section: Analysis Of Variancementioning
confidence: 99%