Whether the Amborella/Amborella-Nymphaeales or the grass lineage diverged first within the angiosperms has recently been debated. Central to this issue has been focused on the artifacts that might result from sampling only grasses within the monocots. We therefore sequenced the entire chloroplast genome (cpDNA) of Phalaenopsis aphrodite, Taiwan moth orchid. The cpDNA is a circular molecule of 148,964 bp with a comparatively short single-copy region (11,543 bp) due to the unusual loss and truncation/scattered deletion of certain ndh subunits. An open reading frame, orf91, located in the complementary strand of the rrn23 was reported for the first time. A comparison of nucleotide substitutions between P. aphrodite and the grasses indicates that only the plastid expression genes have a strong positive correlation between nonsynonymous (Ka) and synonymous (Ks) substitutions per site, providing evidence for a generation time effect, mainly across these genes. Among the intron-containing protein-coding genes of the sampled monocots, the Ks of the genes are significantly correlated to transitional substitutions of their introns. We compiled a concatenated 61 protein-coding gene alignment for the available 20 cpDNAs of vascular plants and analyzed the data set using Bayesian inference, maximum parsimony, and neighbor-joining (NJ) methods. The analyses yielded robust support for the Amborella/Amborella-Nymphaeales-basal hypothesis and for the orchid and grasses together being a monophyletic group nested within the remaining angiosperms. However, the NJ analysis using Ka, the first two codon positions, or amino acid sequences, respectively, supports the monocots-basal hypothesis. We demonstrated that these conflicting angiosperm phylogenies are most probably linked to the transitional sites at all codon positions, especially at the third one where the strong base-composition bias and saturation effect take place.
We estimated the dates of the monocot-dicot split and the origin of core eudicots using a large chloroplast (cp) genomic dataset. Sixty-one protein-coding genes common to the 12 completely sequenced cp genomes of land plants were concatenated and analyzed. Three reliable split events were used as calibration points and for cross references. Both the method based on the assumption of a constant rate and the Li-Tanimura unequal-rate method were used to estimate divergence times. The phylogenetic analyses indicated that nonsynonymous substitution rates of cp genomes are unequal among tracheophyte lineages. For this reason, the constant-rate method gave overestimates of the monocot-dicot divergence and the age of core eudicots, especially when fast-evolving monocots were included in the analysis. In contrast, the Li-Tanimura method gave estimates consistent with the known evolutionary sequence of seed plant lineages and with known fossil records. Combining estimates calibrated by two known fossil nodes and the Li-Tanimura method, we propose that monocots branched off from dicots 140-150 Myr ago (late Jurassic-early Cretaceous), at least 50 Myr younger than previous estimates based on the molecular clock hypothesis, and that the core eudicots diverged 100-115 Myr ago (Albian-Aptian of the Cretaceous). These estimates indicate that both the monocot-dicot divergence and the core eudicot's age are older than their respective fossil records.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.