Insertions and deletions in the RNA sequence–structure map

Martin, Nora S; Ahnert, Sebastian E.

doi:10.1098/rsif.2021.0380

Cited by 19 publications

(32 citation statements)

References 63 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…the temperature T = 37 • C), using the ViennaRNA suboptimal function [7] and the Boltzmann probabilities of these are obtained using the partition function. The energy range for the suboptimals is 15k B T as in [40] to be consistent with the RNAshapes data. The final ND GP map is constructed by mapping each genotype sequence to its ensemble of structures in the energy range (including unfolded structure), as well as their respective normalized probabilities.…”

Section: A Rna12supporting

confidence: 74%

The non-deterministic genotype-phenotype map of RNA secondary structure

García-Galindo

Ahnert

Martin³

2023

Preprint

Self Cite

View full text Add to dashboard Cite

Selection and variation are both key aspects in the evolutionary process. Previous research on the mapping between molecular sequence (genotype) and molecular fold (phenotype) has shown the presence of several structural properties in different biological contexts, implying that these might be universal in evolutionary spaces. The deterministic genotype-phenotype (GP) map that links short RNA sequences to minimum free energy secondary structures has been studied extensively because of its computational tractability and biologically realistic nature. However, this mapping ignores the phenotypic plasticity of RNA. We define a GP map that incorporates non-deterministic phenotypes, and take RNA as a case study; we use the Boltzmann probability distribution of folded structures and examine the structural properties of non-deterministic (ND) GP maps for RNA sequences of length 12 and coarse-grained RNA structures of length 30 (RNAshapes30). A framework is presented to study robustness, evolvability and neutral spaces in the non-deterministic map. This framework is validated by demonstrating close correspondence between the non-deterministic quantities and sample averages of their deterministic counterparts. When using the non-deterministic framework we observe the same structural properties as in the deterministic GP map, such as bias, negative correlation between genotypic robustness and evolvability, and positive correlation between phenotypic robustness and evolvability.

show abstract

Section: A Rna12supporting

confidence: 74%

The non-deterministic genotype-phenotype map of RNA secondary structure

García-Galindo

Ahnert

Martin³

2023

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…A valid secondary structure has a matching closing bracket for each opening bracket, hairpin loops have a minimum length of three bases and there are no pseudo-knots [ 1 ]. Applying these requirements allows us to generate all >1.3 × 10 7 valid structures of length L = 35, following our previous work [ 50 ]: essentially, we start with a list of starting symbols (either a dot or an opening bracket) and extend these recursively. At each step, we append each of the three dot–bracket symbols (dot, opening bracket and closing bracket), unless adding a certain symbol would make it impossible to turn the string into a valid structure of length L = 35 (for example, by opening more brackets than could be closed, closing more brackets than have been opened, creating a hairpin loop below the minimum length, etc.).…”

Section: Methodsmentioning

confidence: 99%

Fast free-energy-based neutral set size estimates for the RNA genotype–phenotype map

Martin

Ahnert

2022

J. R. Soc. Interface.

Self Cite

View full text Add to dashboard Cite

The genotype–phenotype (GP) map of RNA secondary structure links each RNA sequence to its corresponding secondary structure. Previous research has shown that the large-scale structural properties of GP maps, such as the size of neutral sets in genotype space, can influence evolutionary outcomes. In order to use neutral set sizes, efficient and accurate computational methods are needed to compute them. Here, we propose a new method, which is based on free energy estimates and is much faster than existing sample-based methods. Moreover, this approach can give insight into the reasons behind neutral set size variations, for example, why structures with fewer stacks tend to have larger neutral set sizes. In addition, we generalize neutral set size calculations from the previously studied many-to-one framework, where each sequence folds into a single energetically preferred structure, to a fuller many-to-many framework, where several low-energy structures are included. We find that structures with high neutral sets in one framework also tend to have large neutral sets in the other framework for a range of parameters and thus the choice of GP map does not fundamentally affect which structures have the largest neutral set sizes.

show abstract

“…To begin our analysis, following ref. [30] (and see also [37, 38]), we use the RNAshapes [39, 40] method. According to this method, an RNA dot-bracket SS can be abstracted to one of five levels, of increasing abstraction, by ignoring details such as the length of loops, but including broad shape features.…”

Section: Resultsmentioning

confidence: 99%

“…It is known that a single strand of RNA can fold into more than one possible structure, and some strands even form different structures in vivo and in vitro [73]. Further, even if a given sequence has a minimum free energy SS which dominates over other suboptimal SS, nonetheless the sequence will assume different SS in accordance with a Boltzmann distribution [38, 74]. As is common practice in biology and bioinformatics — as well as the vast majority of earlier RNA SS studies — here we have simplified the GP map by assuming that the minimum free energy SS predicted by the computational folding package is ‘the’ single phenotype.…”

Section: Discussionmentioning

confidence: 99%

Random and natural non-coding RNA have similar structural motif patterns but can be distinguished by bulge, loop, and bond counts

Ghaddar

Dingle

2022

Preprint

View full text Add to dashboard Cite

An important question in evolutionary biology is whether and in what ways genotype-phenotype (GP) map biases can influence evolutionary trajectories. Untangling the relative roles of natural selection and biases (and other factors) in shaping phenotypes can be difficult. Because RNA secondary structure (SS) can be analysed in detail mathematically and computationally, is biologically relevant, and a wealth of bioinformatic data is available, it offers a good model system for studying the role of bias. For quite short RNA (length L ≤ 126), it has recently been shown that natural and random RNA are structurally very similar, suggesting that bias strongly constrains evolutionary dynamics. Here we extend these results with emphasis on much larger RNA with length up to 3000 nucleotides. By examining both abstract shapes and structural motif frequencies (ie the numbers of helices, bonds, bulges, junctions, and loops), we find that large natural and random structures are also very similar, especially when contrasted to typical structures sampled from the space of all possible RNA structures. Our motif frequency study yields another result, that the frequencies of different motifs can be used in machine learning algorithms to classify random and natural RNA with quite high accuracy, especially for longer RNA (eg ROC AUC 0.86 for L = 1000). The most important motifs for classification are found to be the number of bulges, loops, and bonds. This finding may be useful in using SS to detect candidates for functional RNA within `junk' DNA regions.

show abstract

Insertions and deletions in the RNA sequence–structure map

Cited by 19 publications

References 63 publications

The non-deterministic genotype-phenotype map of RNA secondary structure

The non-deterministic genotype-phenotype map of RNA secondary structure

Fast free-energy-based neutral set size estimates for the RNA genotype–phenotype map

Random and natural non-coding RNA have similar structural motif patterns but can be distinguished by bulge, loop, and bond counts

Contact Info

Product

Resources

About