2016
DOI: 10.1261/rna.057364.116
|View full text |Cite
|
Sign up to set email alerts
|

Genome-scale characterization of RNA tertiary structures and their functional impact by RNA solvent accessibility prediction

Abstract: As most RNA structures are elusive to structure determination, obtaining solvent accessible surface areas (ASAs) of nucleotides in an RNA structure is an important first step to characterize potential functional sites and core structural regions. Here, we developed RNAsnap, the first machine-learning method trained on protein-bound RNA structures for solvent accessibility prediction. Built on sequence profiles from multiple sequence alignment (RNAsnap-prof), the method provided robust prediction in fivefold cr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
44
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2

Relationship

5
2

Authors

Journals

citations
Cited by 31 publications
(49 citation statements)
references
References 57 publications
5
44
0
Order By: Relevance
“…This is probably because synonymous mutations that don't change expressed proteins affect biological functions mainly through the change of RNA secondary structure. The predictions by RNAplfold achieved a PCC of 0.749 that's greater than the PCC of 0.473 with the predicted ASA(accessible surface area) from the RNAsnap-seq, consistent with the previous study (Yang, et al, 2017). This ranking order is consistent with all other four types of mutations, non-synonymous mutations, stop-gain mutations, and mutations occurring in the 3'UTR (untranslated region), and 5'UTR regions (Figure 3 and Figure S1-S3 in supplemental file).…”
Section: Relation Of Predicted Secondary Structure With the Maf Of Gesupporting
confidence: 90%
“…This is probably because synonymous mutations that don't change expressed proteins affect biological functions mainly through the change of RNA secondary structure. The predictions by RNAplfold achieved a PCC of 0.749 that's greater than the PCC of 0.473 with the predicted ASA(accessible surface area) from the RNAsnap-seq, consistent with the previous study (Yang, et al, 2017). This ranking order is consistent with all other four types of mutations, non-synonymous mutations, stop-gain mutations, and mutations occurring in the 3'UTR (untranslated region), and 5'UTR regions (Figure 3 and Figure S1-S3 in supplemental file).…”
Section: Relation Of Predicted Secondary Structure With the Maf Of Gesupporting
confidence: 90%
“…This is because sequence conservations in regions with different flexibility have different patterns. In a previous study, we have obtained evolution‐based sequence profiles by querying the RNA sequences against RNA sequence library using BLASTN with E ‐value < 0.001 and maximum of 50,000 homologous sequences . The j base probability ( j = A, T/U, G, C) in multiple aligned homologous sequences at a given position i , P i , j was calculated as P i , j = – log[( N i , j )/∑ j ( N i , j )], where N i , j is the number of observed base type j at position i .…”
Section: Methodsmentioning
confidence: 99%
“…s ( b i ) was set to 0.3 for the other base type b i and 9.0 for the query base type. The obtained sequence profiles were normalized to a range of (–1, 1) before used for training and test …”
Section: Methodsmentioning
confidence: 99%
“…Moreover, it is the lowest sequence identity cutoff allowed by the program CD-HIT [59]. This cutoff was also employed previously for establishing non-redundant RNA sequences [61,62] In addition to the HTlncRNA set as the negative set, we also included mRNAs from GENCODE V19 as the negative set. These mRNAs were randomly selected with <80% sequence similarity between each other and from selected HTlncRNAs and EVlncRNAs.…”
Section: Training and Test Datasets For Human Lncrnasmentioning
confidence: 99%
“…Predicted solvent accessible surface area (ASA) of RNA. RNA ASA values were predicted by RNAsnap [61].…”
Section: Features Based On Sequencesmentioning
confidence: 99%