Background
Carduus, commonly known as plumeless thistles, is a genus in the Asteraceae family that exhibits both medicinal value and invasive tendencies. However, the genomic data of Carduus (i.e., complete chloroplast genomes) have not been sequenced.
Methods
We sequenced and assembled the chloroplast genome (cpDNA) sequences of three Carduus species using the Illumina Miseq sequencing system and Geneious Prime. Phylogenetic relationships between Carduus and related taxa were reconstructed using Maximum Likelihood and Bayesian Inference analyses. In addition, we used a single nucleotide polymorphism (SNP) in the protein coding region of the matK gene to develop molecular markers to distinguish C. crispus from C. acanthoides and C. tenuiflorus.
Results
The cpDNA sequences of C. crispus, C. acanthoides, and C. tenuiflorus ranged from 152,342 bp to 152,617 bp in length. Comparative genomic analysis revealed high conservation in terms of gene content (including 80 protein-coding, 30 tRNA, and four rRNA genes) and gene order within the three focal species and members of subfamily Carduoideae. Despite their high similarity, the three species differed with respect to the number and content of repeats in the chloroplast genome. Additionally, eight hotspot regions, including psbI-trnS_GCU, trnE_UUC-rpoB, trnR_UCU-trnG_UCC, psbC-trnS_UGA, trnT_UGU-trnL_UAA, psbT-psbN, petD-rpoA, and rpl16-rps3, were identified in the study species. Phylogenetic analyses inferred from 78 protein-coding and non-coding regions indicated that Carduus is polyphyletic, suggesting the need for additional studies to reconstruct relationships between thistles and related taxa. Based on a SNP in matK, we successfully developed a molecular marker and protocol for distinguishing C. crispus from the other two focal species. Our study provides preliminary chloroplast genome data for further studies on plastid genome evolution, phylogeny, and development of species-level markers in Carduus.