Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genome organization and content. Most maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17 and PH207) with uniform structural annotations of TEs. Among these genomes we identified approximately 400 000 TEs that are polymorphic, encompassing 1.6 Gb of variable TE sequence. These polymorphic TEs include a combination of recent transposition events as well as deletions of older TEs. There are examples of polymorphic TEs within each of the superfamilies of TEs and they are found distributed across the genome, including in regions of recent shared ancestry among individuals. There are many examples of polymorphic TEs within or near maize genes. In addition, there are 2380 gene annotations in the B73 genome that are located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in annotated gene content among these genotypes. TEs are highly variable in our survey of four temperate maize genomes, highlighting the major contribution of TEs in driving variation in genome organization and gene content.
Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genomic organization. The majority of maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17, and PH207). Among these genomes we identified 1.6 Gb of variable TE sequence representing a combination of recent TE movement and deletion of previously existing TEs. Although recent TE movement only accounted for a portion of the TE variability, we identified 4,737 TEs unique to one genome with defined insertion sites in all other genomes. Variable TEs are found for all superfamilies and are distributed across the genome, including in regions of recent shared ancestry among individuals. There are 2,380 genes annotated in the B73 genome located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in gene content among these genotypes. The large scope of TE variation present in this limited sample of temperate maize genomes highlights the major contribution of TEs in driving variation in genome organization and gene content.Significance StatementThe majority of the maize genome is comprised of transposable elements (TEs) that have the potential to create genomic variation within species. We developed a method to identify shared and non-shared TEs using whole genome assemblies of four maize inbred lines. Variable TEs are found throughout the maize genome and in comparisons of any two genomes we find ~20% of the genome is due to non-shared TEs. Several thousand maize genes are found within TEs that are variable across lines, highlighting the contribution of TEs to gene content variation. This study creates a comprehensive resource for genomic studies of TE variability among four maize genomes, which will enable studies on the consequences of variable TEs on genome function.
SUMMARYMaize is a diverse paleotetraploid species with considerable presence/absence variation and copy number variation. One mechanism through which presence/absence variation can arise is differential fractionation. Fractionation refers to the loss of duplicate gene pairs from one of the maize subgenomes during diploidization. Differential fractionation refers to non-shared gene loss events between individuals following a whole-genome duplication event. We investigated the prevalence of presence/absence variation resulting from differential fractionation in the syntenic portion of the genome using two whole-genome de novo assemblies of the inbred lines B73 and PH207. Between these two genomes, syntenic genes were highly conserved with less than 1% of syntenic genes being subject to differential fractionation. The few variably fractionated syntenic genes that were identified are unlikely to contribute to functional phenotypic variation, as there is a significant depletion of these genes in annotated gene sets. In further comparisons of 60 diverse inbred lines, non-syntenic genes were six times more likely to be variable than syntenic genes, suggesting that comparisons among additional genome assemblies are not likely to result in the discovery of large-scale presence/absence variation among syntenic genes.
Tandem duplicate genes are proximally duplicated and as such occur in similar genomic neighborhoods. Using the maize B73 and PH207 de novo genome assemblies, we identified thousands of tandem gene duplicates that account for ∼10% of the annotated genes. These tandem duplicates have a bimodal distribution of ages, which coincide with ancient allopolyploidization and more recent domestication. Tandem duplicates are smaller on average and have a higher probability of containing LTR elements than other genes, suggesting origins in nonhomologous recombination. Within relatively recent tandem duplicate genes, ∼26% appear to be undergoing degeneration or divergence in function from the ancestral copy. Our results show that tandem duplicates are abundant in maize, arose in bursts throughout maize evolutionary history under multiple potential mechanisms, and may provide a substrate for novel phenotypic variation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.