BackgroundThe MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max), including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis.ResultsA total of 244 R2R3-MYB genes were identified and further classified into 48 subfamilies based on a phylogenetic comparative analysis with their putative orthologs, showed both gene loss and duplication events. The phylogenetic analysis showed that most characterized MYB genes with similar functions are clustered in the same subfamily, together with the identification of orthologs by synteny analysis, functional conservation among subgroups of MYB genes was strongly indicated. The phylogenetic relationships of each subgroup of MYB genes were well supported by the highly conserved intron/exon structures and motifs outside the MYB domain. Synonymous nucleotide substitution (dN/dS) analysis showed that the soybean MYB DNA-binding domain is under strong negative selection. The chromosome distribution pattern strongly indicated that genome-wide segmental and tandem duplication contribute to the expansion of soybean MYB genes. In addition, we found that ~ 4% of soybean R2R3-MYB genes had undergone alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Comparative expression profile analysis of R2R3-MYB genes in soybean and Arabidopsis revealed that MYB genes play conserved and various roles in plants, which is indicative of a divergence in function.ConclusionsIn this study we identified the largest MYB gene family in plants known to date. Our findings indicate that members of this large gene family may be involved in different plant biological processes, some of which may be potentially involved in legume-specific nodulation. Our comparative genomics analysis provides a solid foundation for future functional dissection of this family gene.
MYB proteins comprise a large family of plant transcription factors, members of which perform a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). In the present study, we performed a comprehensive computational analysis, to yield a complete overview of the R2R3-MYB gene family in maize, including the phylogeny, expression patterns, and also its structural and functional characteristics. The MYB gene structure in maize and Arabidopsis were highly conserved, indicating that they were originally compact in size. Subgroup-specific conserved motifs outside the MYB domain may reflect functional conservation. The genome distribution strongly supports the hypothesis that segmental and tandem duplication contribute to the expansion of maize MYB genes. We also performed an updated and comprehensive classification of the R2R3-MYB gene families in maize and other plant species. The result revealed that the functions were conserved between maize MYB genes and their putative orthologs, demonstrating the origin and evolutionary diversification of plant MYB genes. Species-specific groups/subgroups may evolve or be lost during evolution, resulting in functional divergence. Expression profile study indicated that maize R2R3-MYB genes exhibit a variety of expression patterns, suggesting diverse functions. Furthermore, computational prediction potential targets of maize microRNAs (miRNAs) revealed that miR159, miR319, and miR160 may be implicated in regulating maize R2R3-MYB genes, suggesting roles of these miRNAs in post-transcriptional regulation and transcription networks. Our comparative analysis of R2R3-MYB genes in maize confirm and extend the sequence and functional characteristics of this gene family, and will facilitate future functional analysis of the MYB gene family in maize.
R2R3-MYB proteins (2R-MYBs) are one of the main transcription factor families in higher plants. Since the evolutionary history of this gene family across the eukaryotic kingdom remains unknown, we performed a comparative analysis of 2R-MYBs from 50 major eukaryotic lineages, with particular emphasis on land plants. A total of 1548 candidates were identified among diverse taxonomic groups, which allowed for an updated classification of 73 highly conserved subfamilies, including many newly identified subfamilies. Our results revealed that the protein architectures, intron patterns, and sequence characteristics were remarkably conserved in each subfamily. At least four subfamilies were derived from early land plants, 10 evolved from spermatophytes, and 19 from angiosperms, demonstrating the diversity and preferential expansion of this gene family in land plants. Moreover, we determined that their remarkable expansion was mainly attributed to whole genome and segmental duplication, where duplicates were preferentially retained within certain subfamilies that shared three homologous intron patterns (a, b, and c) even though up to 12 types of patterns existed. Through our integrated distributions, sequence characteristics, and phylogenetic tree analyses, we confirm that 2R-MYBs are old and postulate that 3R-MYBs may be evolutionarily derived from 2R-MYBs via intragenic domain duplication.
MYB genes are widely distributed in higher plants and comprise one of the largest transcription factors, which are characterized by the presence of a highly conserved MYB domain at their N-termini. Over recent decades, biochemical and molecular characterizations of MYB have been extensively studied and reported to be involved in many physiological and biochemical processes. This review describes current knowledge of their structure characteristic, classification, multi-functionality, mechanism of combinational control, evolution, and function redundancy. It shows that the MYB transcription factors play a key role in plant development, such as secondary metabolism, hormone signal transduction, disease resistance, cell shape, organ development, etc. Furthermore, the expression of some members of the MYB family shows tissue-specificity.
MYB proteins constitute one of the largest transcription factor families in plants. Recent evidence revealed that MYB-related genes play crucial roles in plants. However, compared with the R2R3-MYB type, little is known about the complex evolutionary history of MYB-related proteins in plants. Here, we present a genome-wide analysis of MYB-related proteins from 16 species of flowering plants, moss, Selaginella, and algae. We identified many MYB-related proteins in angiosperms, but few in algae. Phylogenetic analysis classified MYB-related proteins into five distinct subgroups, a result supported by highly conserved intron patterns, consensus motifs, and protein domain architecture. Phylogenetic and functional analyses revealed that the Circadian Clock Associated 1-like/R-R and Telomeric DNA-binding protein-like subgroups are >1 billion yrs old, whereas the I-box-binding factor-like and CAPRICE-like subgroups appear to be newly derived in angiosperms. We further demonstrated that the MYB-like domain has evolved under strong purifying selection, indicating the conservation of MYB-related proteins. Expression analysis revealed that the MYB-related gene family has a wide expression profile in maize and soybean development and plays important roles in development and stress responses. We hypothesize that MYB-related proteins initially diversified through three major expansions and domain shuffling, but remained relatively conserved throughout the subsequent plant evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.