The species of Clostridium comprise a very heterogeneous assemblage of bacteria that do not form a phylogenetically coherent group. It has been proposed previously that only a subset of the species of Clostridium that form a distinct cluster in the 16S rRNA tree (cluster I) should be regarded as the true representatives of the genus Clostridium (i.e. Clostridium sensu stricto). However, this cluster is presently defined only in phylogenetic terms, and no biochemical, molecular or phenotypic characteristic is known that is unique to species from this cluster. We report here phylogenomic and comparative analyses based on sequenced clostridial genomes in an attempt to bridge this gap and to clarify the evolutionary relationships among species of clostridia. In phylogenetic trees for species of clostridia based on concatenated sequences for 37 highly conserved proteins, the species of Clostridium cluster I formed a strongly supported clade that was separated from all other clostridia by a long branch. Several other Clostridium species that are not part of this cluster grouped reliably with other species of clostridia in a number of wellresolved clades. Our comparative genomic analyses have identified three conserved indels in three highly conserved proteins (a 4 aa insert in DNA gyrase A, a 1 aa deletion in ATP synthase beta subunit and a 1 aa insert in ribosomal protein S2) that are unique to the species of Clostridium cluster I and are not found in any other bacteria. BLASTP searches on various proteins in the genomes of Clostridium tetani E88 and Clostridium perfringens SM101 have also identified more than 10 proteins that are found uniquely in the cluster I species. These results provide evidence that the species of Clostridium cluster I not only are phylogenetically distinct but also share many unique molecular characteristics. These newly identified molecular markers provide useful tools to define and circumscribe the genus Clostridium sensu stricto in more definitive terms. We have also identified a 7-9 aa conserved insert in the enzyme phosphoglycerate dehydrogenase that is uniquely found in the Clostridium thermocellum, Thermoanaerobacter pseudethanolicus, Thermoanaerobacter tengcogensis and Caldicellulosiruptor saccharolyticus homologues, and is absent from all other bacteria. These species form a well-defined clade in the phylogenetic trees and this indel provides a potential molecular marker for this clostridial cluster.
INTRODUCTIONThe species belonging to the genus Clostridium comprise a heterogeneous assemblage that does not form a monophyletic grouping in phylogenetic trees based on different gene/protein sequences (Cato et al., 1986;Lawson et al., 1993;Stackebrandt et al., 1999;Finegold et al., 2002;Wiegel et al., 2006). Many species were originally assigned to this genus based on their being Gram-positive, anaerobic rods that are capable of endospore formation (Cato et al., 1986;Cato & Stackebrandt, 1989;Lawson et al., 1993;Finegold et al., 2002;Rood, 2006). However, these criteria led to the place...