BackgroundClostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence.ResultsWe performed next generation sequencing using Illumina MiSeq technology on the DSM10061 strain of Clostridium autoethanogenum and observed 243 single nucleotide discrepancies when compared to the published finished sequence (NCBI: GCA_000484505.1), with 59.1 % present in coding regions. These variations were confirmed by Sanger sequencing and subsequent analysis suggested that the discrepancies were sequencing errors in the published genome not true single nucleotide polymorphisms. This was corroborated by the observation that over 90 % occurred within homopolymer regions of greater than 4 nucleotides in length. It was also observed that many genes containing these sequencing errors were annotated in the published closed genome as encoding proteins containing frameshift mutations (18 instances) or were annotated despite the coding frame containing stop codons, which if genuine, would severely hinder the organism’s ability to survive. Furthermore, we have completed a comprehensive manual curation to reduce errors in the annotation that occur through serial use of automated annotation pipelines in related species. As a result, different functions were assigned to gene products or previous functional annotations rejected because of missing evidence in various occasions.ConclusionsWe present a revised manually curated full genome sequence for Clostridium autoethanogenum DSM10061, which provides reliable information for genome-scale models that rely heavily on the accuracy of annotation, and represents an important step towards the manipulation and metabolic modelling of this industrially relevant acetogen.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-2287-5) contains supplementary material, which is available to authorized users.
Clostridium encompasses species which are relevant to human and animal disease as well as species which have industrial potential, for instance, as producers of chemicals and fuels or as tumour delivery vehicles. Genetic manipulation of these target organisms is critical for advances in these fields. DNA transfer efficiencies, however, vary between species. Low efficiencies can impede the progress of research efforts.A novel conjugal donor strain of Escherichia coli has been created which exhibits a greater than 10-fold increases in conjugation efficiency compared to the traditionally used CA434 strain in the three species tested; C. autoethanogenum DSM 10061, C. sporogenes NCIMB 10696 and C. difficile R20291. The novel strain, designated ‘sExpress’, does not methylate DNA at Dcm sites (CCWGG) which allows circumvention of cytosine-specific Type IV restriction systems.A robust protocol for conjugation is presented which routinely produces in the order of 105 transconjugants per millilitre of donor cells for C. autoethanogenum, 106 for C. sporogenes and 102 for C. difficile R20291. The novel strain created is predicted to be a superior conjugal donor in a wide range of species which possess Type IV restriction systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.