Background
The Begonia species are common shade plants that are mostly found in southwest China. They have not been well studied despite their medicinal and decorative uses because gene penetration, decreased adaptability, and restricted availability are all caused by frequent interspecific hybridization.
Result
To understand the patterns of mutation in the chloroplast genomes of different species of Begonia, as well as their evolutionary relationship, we collected seven Begonia species in China and sequenced their chloroplast genomes. Begonia species exhibit a quadripartite structure of chloroplast genomes (157,634 − 169,694 bp), consisting of two pairs of inverted repeats (IR: 26,529 − 37,674 bp), a large single copy (LSC: 75,477 − 86,500 bp), and a small single copy (SSC: 17,861 − 18,367 bp). 128–142 genes (comprising 82–93 protein-coding genes, 8 ribosomal RNAs, and 36–43 transfer RNAs) are found in the chloroplast genomes. Based on comparative analyses, this taxon has a relatively similar genome structure. A total of ten substantially divergent DNA regions (rpl32, trnS-GCU, trnT-UGU-trnL-UAA, atpF-atpH, ycf4-cemA, rps19-trnG-UCC, trnD-GUC-trnY-GUA, trnC-GCA-petN, trnE-UUC-trnT-GGU and petD-rpoA) are found in the seventeen chloroplast genomes. These regions are suitable for species identification and phylogeographic analysis. Phylogenetic analysis shows that Begonia species that were suited to comparable environments grouped in a small clade and that all Begonia species formed one big clade in the phylogenetic tree, supporting the genus' monophyly. In addition, positive selection sites are discovered in eight genes (rpoC1, rpoB, psbE, psbK, petA, rps12, rpl2 and rpl22), the majority of which are involved in protein production and photosynthesis.
Conclusion
Using these genome resources, we can resolve deep-level phylogenetic relationships between Begonia species and their families, leading to a better understanding of evolutionary processes. In addition to enhancing species identification and phylogenetic resolution, these results demonstrate the utility of complete chloroplast genomes in phylogenetically and taxonomically challenging plant groupings.