2020
DOI: 10.1021/acssynbio.0c00394
|View full text |Cite
|
Sign up to set email alerts
|

An Automated Model Test System for Systematic Development and Improvement of Gene Expression Models

Abstract: Gene expression models greatly accelerate the engineering of synthetic metabolic pathways and genetic circuits by predicting sequence-function relationships and reducing trial-and-error experimentation. However, developing models with more accurate predictions is a significant challenge, even though they are essential to engineering complex genetic systems. Here we present a model test system that combines advanced statistics, machine learning, and a database of 9862 characterized genetic systems to automatica… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
118
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 117 publications
(118 citation statements)
references
References 58 publications
0
118
0
Order By: Relevance
“…We designed the 14206 promoter sequences to measure how each motif sequence alters these free energies to create a sequence-complete model, including all possible -10 hexamers, all possible - 35 hexamers, all possible -10 extended motifs within 8 consensus and anti-consensus background configurations, 229 sequence spacers with varied lengths and nucleotide compositions, 2420 UP elements with varied AT content within 4 consensus and anti-consensus background configurations, 735 discriminator elements with varied lengths and GC content, and 582 ITR sequences with varied R-loop stabilities ( Figure 1B ). Using oligopool synthesis and two-step library cloning, we constructed a barcoded plasmid pool that uses each promoter in a common genetic context, expressing a single protein with a moderate translation initiation rate (about 5000 on the RBS Calculator v2.1 scale 5 ), with unique barcode sequences positioned in the 3’ untranslated region to avoid confounding effects.…”
Section: Main Textmentioning
confidence: 99%
See 2 more Smart Citations
“…We designed the 14206 promoter sequences to measure how each motif sequence alters these free energies to create a sequence-complete model, including all possible -10 hexamers, all possible - 35 hexamers, all possible -10 extended motifs within 8 consensus and anti-consensus background configurations, 229 sequence spacers with varied lengths and nucleotide compositions, 2420 UP elements with varied AT content within 4 consensus and anti-consensus background configurations, 735 discriminator elements with varied lengths and GC content, and 582 ITR sequences with varied R-loop stabilities ( Figure 1B ). Using oligopool synthesis and two-step library cloning, we constructed a barcoded plasmid pool that uses each promoter in a common genetic context, expressing a single protein with a moderate translation initiation rate (about 5000 on the RBS Calculator v2.1 scale 5 ), with unique barcode sequences positioned in the 3’ untranslated region to avoid confounding effects.…”
Section: Main Textmentioning
confidence: 99%
“…Transcription is the gene expression process responsible for producing all RNA and is a common engineering target for creating novel products, including microbial chemical factories, toxinsensing genetic circuits, and mRNA vaccines [1][2][3] . However, while DNA assembly techniques enable the construction of custom-designed genetic systems 4 , it remains challenging to a priori predict and control a system's gene expression profile 5 , for example, by initiating transcription with desired rates at specific DNA start sites, while minimizing transcription from all other DNA sequence regions. Currently, transcriptional control relies on empirical characterization of promoters as modular genetic parts 6 .…”
Section: Main Textmentioning
confidence: 99%
See 1 more Smart Citation
“…We therefore augmented the RBS Calculator model to explicitly include the apparent translation elongation rate of the protein’s coding sequence. As the starting point, the RBS Calculator calculates the ribosome’s binding free energy (ΔG total ) using a 5-term free energy model 20, 21, 25, 26 and then predicts a protein coding sequence’s translation initiation rate (r init ) according to: where the 30S ribosomal subunit’s binding free energy (ΔG total ) depends only on the mRNA’s sequence and c 1 is a proportionality constant that accounts for extrinsic differences here influenced by the cosolute composition. After the 30S ribosomal subunit binds to the mRNA, it recruits the 50S ribosomal subunit, forms the 70S initiation complex, and initiates translation.…”
Section: Resultsmentioning
confidence: 99%
“…Different RBS sequences were designed for each CFPS system to replace the original RBS sequence between the T7 promoter sequence and the initiation of sfGFP gene. RBS sequences were designed according to different hosts by the RBS library calculator (De Novo DNA: RBS Library Calculator) (Reis and Salis 2020 ; Salis et al 2009 ). Because the range of translation initiation rates were very wide, so six RBS sequences (Table 1 ) were equably selected from the minimum to the maximum translation initiation rates.…”
Section: Methodsmentioning
confidence: 99%