The classification of the legume family proposed here addresses the long‐known non‐monophyly of the traditionally recognised subfamily Caesalpinioideae, by recognising six robustly supported monophyletic subfamilies. This new classification uses as its framework the most comprehensive phylogenetic analyses of legumes to date, based on plastid matK gene sequences, and including near‐complete sampling of genera (698 of the currently recognised 765 genera) and ca. 20% (3696) of known species. The matK gene region has been the most widely sequenced across the legumes, and in most legume lineages, this gene region is sufficiently variable to yield well‐supported clades. This analysis resolves the same major clades as in other phylogenies of whole plastid and nuclear gene sets (with much sparser taxon sampling). Our analysis improves upon previous studies that have used large phylogenies of the Leguminosae for addressing evolutionary questions, because it maximises generic sampling and provides a phylogenetic tree that is based on a fully curated set of sequences that are vouchered and taxonomically validated. The phylogenetic trees obtained and the underlying data are available to browse and download, facilitating subsequent analyses that require evolutionary trees. Here we propose a new community‐endorsed classification of the family that reflects the phylogenetic structure that is consistently resolved and recognises six subfamilies in Leguminosae: a recircumscribed Caesalpinioideae DC., Cercidoideae Legume Phylogeny Working Group (stat. nov.), Detarioideae Burmeist., Dialioideae Legume Phylogeny Working Group (stat. nov.), Duparquetioideae Legume Phylogeny Working Group (stat. nov.), and Papilionoideae DC. The traditionally recognised subfamily Mimosoideae is a distinct clade nested within the recircumscribed Caesalpinioideae and is referred to informally as the mimosoid clade pending a forthcoming formal tribal and/or clade‐based classification of the new Caesalpinioideae. We provide a key for subfamily identification, descriptions with diagnostic charactertistics for the subfamilies, figures illustrating their floral and fruit diversity, and lists of genera by subfamily. This new classification of Leguminosae represents a consensus view of the international legume systematics community; it invokes both compromise and practicality of use.
Phylogenomic analyses have helped resolve many recalcitrant relationships in the angiosperm tree of life, yet phylogenetic resolution of the backbone of the Leguminosae, one of the largest and most economically and ecologically important families, remains poor due to generally limited molecular data and incomplete taxon sampling of previous studies. Here, we resolve many of the Leguminosae’s thorniest nodes through comprehensive analysis of plastome-scale data using multiple modified coding and noncoding data sets of 187 species representing almost all major clades of the family. Additionally, we thoroughly characterize conflicting phylogenomic signal across the plastome in light of the family’s complex history of plastome evolution. Most analyses produced largely congruent topologies with strong statistical support and provided strong support for resolution of some long-controversial deep relationships among the early diverging lineages of the subfamilies Caesalpinioideae and Papilionoideae. The robust phylogenetic backbone reconstructed in this study establishes a framework for future studies on legume classification, evolution, and diversification. However, conflicting phylogenetic signal was detected and quantified at several key nodes that prevent the confident resolution of these nodes using plastome data alone. [Leguminosae; maximum likelihood; phylogenetic conflict; plastome; recalcitrant relationships; stochasticity; systematic error.]
Phylogenomics is increasingly used to infer deep-branching relationships while revealing the complexity of evolutionary processes such as incomplete lineage sorting, hybridization/introgression and polyploidization. We investigate the deep-branching relationships among subfamilies of the Leguminosae (or Fabaceae), the third largest angiosperm family. Despite their ecological and economic importance, a robust phylogenetic framework for legumes based on genome-scale sequence data is lacking. We generated alignments of 72 chloroplast genes and 7621 homologous nuclear-encoded proteins, for 157 and 76 taxa, respectively. We analysed these with maximum likelihood, Bayesian inference, and a multispecies coalescent summary method, and evaluated support for alternative topologies across gene trees. We resolve the deepest divergences in the legume phylogeny despite lack of phylogenetic signal across all chloroplast genes and the majority of nuclear genes. Strongly supported conflict in the remainder of nuclear genes is suggestive of incomplete lineage sorting. All six subfamilies originated nearly simultaneously, suggesting that the prevailing view of some subfamilies as 'basal' or 'early-diverging' with respect to others should be abandoned, which has important implications for understanding the evolution of legume diversity and traits. Our study highlights the limits of phylogenetic resolution in relation to rapid successive speciation.
Subfamily Caesalpinioideae with ca. 4,600 species in 152 genera is the second-largest subfamily of legumes (Leguminosae) and forms an ecologically and economically important group of trees, shrubs and lianas with a pantropical distribution. Despite major advances in the last few decades towards aligning genera with clades across Caesalpinioideae, generic delimitation remains in a state of considerable flux, especially across the mimosoid clade. We test the monophyly of genera across Caesalpinioideae via phylogenomic analysis of 997 nuclear genes sequenced via targeted enrichment (Hybseq) for 420 species and 147 of the 152 genera currently recognised in the subfamily. We show that 22 genera are non-monophyletic or nested in other genera and that non-monophyly is concentrated in the mimosoid clade where ca. 25% of the 90 genera are found to be non-monophyletic. We suggest two main reasons for this pervasive generic non-monophyly: (i) extensive morphological homoplasy that we document here for a handful of important traits and, particularly, the repeated evolution of distinctive fruit types that were historically emphasised in delimiting genera and (ii) this is an artefact of the lack of pantropical taxonomic syntheses and sampling in previous phylogenies and the consequent failure to identify clades that span the Old World and New World or conversely amphi-Atlantic genera that are non-monophyletic, both of which are critical for delimiting genera across this large pantropical clade. Finally, we discuss taxon delimitation in the phylogenomic era and especially how assessing patterns of gene tree conflict can provide additional insights into generic delimitation. This new phylogenomic framework provides the foundations for a series of papers reclassifying genera that are presented here in Advances in Legume Systematics (ALS) 14 Part 1, for establishing a new higher-level phylogenetic tribal and clade-based classification of Caesalpinioideae that is the focus of ALS14 Part 2 and for downstream analyses of evolutionary diversification and biogeography of this important group of legumes which are presented elsewhere.
Allelic variation within individuals holds information regarding the relationships of organisms, which is expected to be particularly important for reconstructing the evolutionary history of closely related taxa. However, little effort has been committed to incorporate such information for reconstructing the phylogeny of organisms. Haplotype trees represent a solution when one nonrecombinant marker is considered, but there is no satisfying method when multiple genes are to be combined. In this paper, we propose an algorithm that converts a distance matrix of alleles to a distance matrix among organisms. This algorithm allows the incorporation of allelic variation for reconstructing the phylogeny of organisms from one or more genes. The method is applied to reconstruct the phylogeny of the seven native diploid species of Rosa sect. Cinnamomeae in North America. The glyceralgehyde 3-phosphate dehydrogenase (GAPDH), the triose phosphate isomerase (TPI), and the malate synthase (MS) genes were sequenced for 40 individuals from these species. The three genes had little genetic variation, and most species showed incomplete lineage sorting, suggesting these species have a recent origin. Despite these difficulties, the networks (NeighborNet) of organisms reconstructed from the matrix obtained with the algorithm recovered groups that more closely match taxonomic boundaries than did the haplotype trees. The combined network of individuals shows that species west of the Rocky Mountains, Rosa gymnocarpa and R. pisocarpa, form exclusive groups and that together they are distinct from eastern species. In the east, three groups were found to be exclusive: R. nitida-R. palustris, R. foliolosa, and R. blanda-R. woodsii. These groups are congruent with the morphology and the ecology of species. The method is also useful for representing hybrid individuals when the relationships are reconstructed using a phylogenetic network.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.