In the past, bacterial phylogeny relied almost exclusively on 16S rRNA gene sequence analysis. More recently, multilocus sequence analysis has been used to infer organismal phylogenies. In this study, the dnaJ chaperone gene was investigated as a marker for phylogeny studies in alphaproteobacteria. Preliminary analysis of G+C contents and G+C3s contents (the G+C content of the synonymous third codon position) showed no clear evidence of horizontal transfer of this gene in proteobacteria. dnaJ-based phylogenies were then analysed at three taxonomic levels: the Proteobacteria, the Alphaproteobacteria and the genus Mesorhizobium. Dendrograms based on DnaJ and 16S rRNA gene sequences revealed the same topology described previously for the Proteobacteria. These results indicate that the DnaJ phylogenetic signal is able to reproduce the accepted relationships among the five classes of the Proteobacteria. At a lower taxonomic level, using 20 alphaproteobacteria, the 16S rRNA gene-based phylogeny is distinct from the one based on DnaJ sequence analysis. Although the same clusters are generated, only the topology of the DnaJ tree is consistent with broader phylogenies from recent studies based on concatenated alignments of multiple core genes. For example, the DnaJ tree shows the two clusters within the Rhizobiales as closely related, as expected, while the 16S rRNA gene-based phylogeny shows them as distantly related. In order to evaluate the phylogenetic performance of dnaJ at the genus level, a multilocus analysis based on five housekeeping genes (atpD, gapA, gyrB, recA and rplB) was performed for ten Mesorhizobium species. In contrast to the 16S rRNA gene, the DnaJ sequence analysis generated a tree similar to the multilocus dendrogram. For identification of chickpea mesorhizobium isolates, a dnaJ nucleotide sequence-based tree was used. Despite different topologies, 16S rRNA gene-and dnaJ-based trees led to the same species identification. This study suggests that the dnaJ gene is a good phylogenetic marker, particularly for the class Alphaproteobacteria, since its phylogeny is consistent with phylogenies based on multilocus approaches.
INTRODUCTIONThe phylum Proteobacteria is composed of five classes, the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria and Epsilonproteobacteria (Garrity et al., 2005;Stackebrandt et al., 1988). By the end of 2008, more than 380 complete genomes of proteobacteria were available in public databases (http://www. ncbi.nlm.nih.gov/genomes/lproks.cgi). Alphaproteobacteria exhibit an enormous diversity in their morphological and metabolic characteristics and the class is presently recognized solely as a clade in the 16S rRNA gene-based phylogeny (Stackebrandt et al., 1988). Based on 16S rRNA gene trees, the Alphaproteobacteria has been divided into seven orders: Caulobacterales, Rhizobiales, Rhodobacterales, Rhodospirillales, Rickettsiales, Sphingomonadales and Parvularculales (Kersters et al., 2006). The Alphaproteobacteria includes important bacteria tha...