2017
DOI: 10.1002/jcc.25124
|View full text |Cite
|
Sign up to set email alerts
|

B‐factor profile prediction for RNA flexibility using support vector machines

Abstract: Determining the flexibility of structured biomolecules is important for understanding their biological functions. One quantitative measurement of flexibility is the atomic Debye-Waller factor or temperature B-factor. Most existing studies are limited to temperature B-factors of proteins and their prediction. Only one method attempted to predict temperature B-factors of ribosomal RNA. Here, we developed and compared machine-learning techniques in prediction of temperature B-factors of RNAs. The best model based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(18 citation statements)
references
References 37 publications
0
18
0
Order By: Relevance
“…In this application, we used SVR to construct the prediction model to estimate the cleavage probability of substrate cleavage sites for a given protease. Owing to its excellent generalization capabilities, SVR has recently been applied in a growing number of applications in bioinformatics and computational biology, including cleavage site prediction [15,29,30], residue accessible surface area [95], protein B-factor [96,97], half sphere exposure [98], disulfide connectivity [99], residue depth [54], torsion angles [29] and protein expression-level prediction [100]. It demonstrates competitive performance compared with other machine learning approaches, especially when dealing with realvalued prediction tasks.…”
Section: Machine Learning Methodsmentioning
confidence: 99%
“…In this application, we used SVR to construct the prediction model to estimate the cleavage probability of substrate cleavage sites for a given protease. Owing to its excellent generalization capabilities, SVR has recently been applied in a growing number of applications in bioinformatics and computational biology, including cleavage site prediction [15,29,30], residue accessible surface area [95], protein B-factor [96,97], half sphere exposure [98], disulfide connectivity [99], residue depth [54], torsion angles [29] and protein expression-level prediction [100]. It demonstrates competitive performance compared with other machine learning approaches, especially when dealing with realvalued prediction tasks.…”
Section: Machine Learning Methodsmentioning
confidence: 99%
“…Moreover, it is the lowest sequence identity cutoff allowed by the program CD-HIT [59]. This cutoff was also employed previously for establishing non-redundant RNA sequences [61,62] In addition to the HTlncRNA set as the negative set, we also included mRNAs from GENCODE V19 as the negative set. These mRNAs were randomly selected with <80% sequence similarity between each other and from selected HTlncRNAs and EVlncRNAs.…”
Section: Training and Test Datasets For Human Lncrnasmentioning
confidence: 99%
“…Other than the above deterministic models, data-driven machine learning models are also considered in flexibility analysis [20][21][22][23][24][25][26][27][28][29], thanks to the accumulation of ever-increasing experimental data. In these learning models, biomolecular genetic, epigenetic, evolutional and structural information are extracted and used as features in machine learning models, such as support vector machine (SVM), random forest (RF), gradient boost tree (GBT) and artificial neural network (ANN).…”
Section: Introductionmentioning
confidence: 99%
“…In these learning models, biomolecular genetic, epigenetic, evolutional and structural information are extracted and used as features in machine learning models, such as support vector machine (SVM), random forest (RF), gradient boost tree (GBT) and artificial neural network (ANN). Among these learning models, an evolution-information-based learning model has been used in RNA flexibility analysis [27]. In this model, position-specific iterative basic local alignment search tool (PSI-BLAST) [30] is considered for homologous sequence identification.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation