2019
DOI: 10.1101/872077
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Learning for RNA Synthetic Biology

Abstract: Engineered RNA elements are programmable tools capable of detecting small molecules, proteins, and nucleic acids. Predicting the behavior of these tools remains a challenge, a situation that could be addressed through enhanced pattern recognition from deep 30 learning. Thus, we investigate Deep Neural Networks (DNN) to predict toehold switch function as a canonical riboswitch model in synthetic biology. To facilitate DNN training, we synthesized and characterized in vivo a dataset of 91,534 toehold switches sp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(11 citation statements)
references
References 40 publications
0
11
0
Order By: Relevance
“…Aside from the observed conservation of the Shine-Dalgarno (SD) and start codon sequences at positions 31-41 and 48-50, respectively, the sequence logo constructed from all 91,534 toeholds confirms that each of the four nucleotides are relatively evenly distributed at each position 23 .…”
Section: Nucleotide Over-representation In Top-performing Toehold Seqmentioning
confidence: 79%
See 4 more Smart Citations
“…Aside from the observed conservation of the Shine-Dalgarno (SD) and start codon sequences at positions 31-41 and 48-50, respectively, the sequence logo constructed from all 91,534 toeholds confirms that each of the four nucleotides are relatively evenly distributed at each position 23 .…”
Section: Nucleotide Over-representation In Top-performing Toehold Seqmentioning
confidence: 79%
“…A dataset of 244,000 toehold switches ( Fig. 1A), including sequences tiled from viruses and the human genome as well as random sequences (see Methods), have been tested experimentally by Angenent-Mari et al 23 , with 91,534 switches meeting well-defined quality control criteria. Each toehold sequence is 59 nucleotides in length, with the first 30 nucleotides distinguishing the unique unstructured region followed by part of the hairpin; the remaining 29 nucleotides can be inferred by hairpin complementarity as well as Shine-Dalgarno sequence and start codon conservation (Table S1).…”
Section: Nucleotide Over-representation In Top-performing Toehold Seqmentioning
confidence: 99%
See 3 more Smart Citations