On circular bacterial chromosomes, the majority of genes are coded on the leading strand. This gene strand bias (GSB) ranges from up to 85% in some Bacillota to a little more than 50% in other phyla. The factors determining the extent of the strand bias remain to be found. Here, we report that species in the phylum Gemmatimonadota share a unique chromosome architecture, distinct from neighboring phyla: in a conserved 600-kb region around the terminus of replication, almost all genes were located on the leading strands, while on the remaining part of the chromosome, the strand preference was more balanced. The high strand bias (HSB) region harbors the rRNA clusters, core, and highly expressed genes. Selective pressure for reduction of collisions with DNA replication to minimize detrimental mutations can explain the conservation of essential genes in this region. Repetitive and mobile elements are underrepresented, suggesting reduced recombination frequency by structural isolation from other parts of the chromosome. We propose that the HSB region forms a distinct chromosomal domain. Gemmatimonadota chromosomes evolved mainly by expansion through horizontal gene transfer and duplications outside of the ancient high strand bias region. In support of our hypothesis, we could further identify two Spiroplasma strains on a similar evolutionary path.
IMPORTANCE
On bacterial chromosomes, a preferred location of genes on the leading strand has evolved to reduce conflicts between replication and transcription. Despite a vast body of research, the question why bacteria show large differences in their gene strand bias is still not solved. The discovery of “hybrid” chromosomes in different phyla, including Gemmatimonadota, in which a conserved high strand bias is found exclusively in a region at
ter
, points toward a role of nucleoid structure, additional to replication, in the evolution of strand preferences. A fine-grained structural analysis of the ever-increasing number of available bacterial genomes could help to better understand the forces that shape the sequential and spatial organization of the cell’s information content.