While bacterial operons have been thoroughly studied, few analyses of chloroplast operons exist, limiting the ability to study fundamental elements of these structures and utilize them for synthetic biology. Here, we describe the creation of a plastome-specific operon database (link provided below) achieved by combining experimental tools and predictive modeling. Using a Reverse-Transcription-PCR based method and published data, we determined the transcription-state of 213 gene pairs from four plastomes of evolutionary distinct organisms. By analyzing sequence-based features computed for our dataset, we were able to highlight fundamental characteristics differentiating between operon pairs and non-operon pairs. These include an interesting tendency toward maintaining similar messenger RNA-folding profiles in operon gene pairs, a feature that failed to yield any informative separation in cyanobacteria, suggesting that it catches unique traits of operon gene expression, which have evolved post-endosymbiosis. Subsequently, we used this feature set to train a random-forest classifier for operon prediction. As our results demonstrate the ability of our predictor to obtain accurate (84%) and robust predictions on unlabeled datasets, we proceeded to building operon maps for 2018 sequenced plastids. Our database may now present new opportunities for promoting metabolic engineering and synthetic biology in chloroplasts.
Chloroplasts originated from an ancient cyanobacterium and still harbor a bacterial-like genome. However, the centrality of Shine–Dalgarno ribosome binding, which predominantly regulates proteobacterial translation initiation, is significantly decreased in chloroplasts. As plastid ribosomal RNA anti-Shine–Dalgarno elements are similar to their bacterial counterparts, these sites alone cannot explain this decline. By computational simulation we show that upstream point mutations modulate the local structure of ribosomal RNA in chloroplasts, creating significantly tighter structures around the anti-Shine–Dalgarno locus, which in-turn reduce the probability of ribosome binding. To validate our model, we expressed two reporter genes (mCherry, hydrogenase) harboring a Shine–Dalgarno motif in the Chlamydomonas reinhardtii chloroplast. Coexpressing them with a 16S ribosomal RNA, modified according to our model, significantly enhances mCherry and hydrogenase expression compared with coexpression with an endogenous 16S gene.
The integration of genes into the nuclear genome of Chlamydomonas reinhardtii is mediated by Non-Homologous-End-Joining, thus resulting in unpredicted insertion locations. This phenomenon defines 'the position-effect', which is used to explain the variation of expression levels between different clones transformed with the same DNA fragment. Likewise, nuclear transgenes often undergo epigenetic silencing that reduces their expression; hence, nuclear transformations require high-throughput screening methods to isolate clones that express the foreign gene at a desirable level. Here, we show that the number of integration sites of heterologous genes results in higher mRNA levels. By transforming both a synthetic ferredoxin-hydrogenase fusion enzyme and a Gaussia-Luciferase reporter protein, we were able to obtain 33 positive clones that exhibit a wide range of synthetic expression. We then performed a droplet-digital polymerasechain-reaction for these lines to measure their transgene DNA copy-number and mRNA levels. Surprisingly, most clones contain two integration sites of the synthetic gene (45.5%), whilst 33.3% contain one, 18.1% include three and 3.1% encompass four. Remarkably, we observed a positive correlation between the raw DNA copy-number values to the mRNA levels, suggesting a general effect of which transcription of transgenes is partially modulated by their number of copies in the genome. However, our data indicate that only clones harboring at least three copies of the target amplicon show a significant increment in mRNA levels of the reporter transgene. Lastly, we measured protein activity for each of the reporter genes to elucidate the effect of copynumber variation on heterologous expression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.