Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana. Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.
Despite evolutionary conserved mechanisms to silence transposable element activity, there are drastic differences in the abundance of transposable elements even among closely related plant species. We conducted a de novo assembly for the 375 Mb genome of the perennial model plant, Arabis alpina. Analysing this genome revealed long-lasting and recent transposable element activity predominately driven by Gypsy long terminal repeat retrotransposons, which extended the low-recombining pericentromeres and transformed large formerly euchromatic regions into repeat-rich pericentromeric regions. This reduced capacity for long terminal repeat retrotransposon silencing and removal in A. alpina co-occurs with unexpectedly low levels of DNA methylation. Most remarkably, the striking reduction of symmetrical CG and CHG methylation suggests weakened DNA methylation maintenance in A. alpina compared with Arabidopsis thaliana. Phylogenetic analyses indicate a highly dynamic evolution of some components of methylation maintenance machinery that might be related to the unique methylation in A. alpina.
The genetic and molecular analysis of trichome development in Arabidopsis thaliana has generated a detailed knowledge about the underlying regulatory genes and networks. However, how rapidly these mechanisms diverge during evolution is unknown. To address this problem, we used an unbiased forward genetic approach to identify most genes involved in trichome development in the related crucifer species Arabis alpina. In general, we found most trichome mutant classes known in A. thaliana. We identified orthologous genes of the relevant A. thaliana genes by sequence similarity and synteny and sequenced candidate genes in the A. alpina mutants. While in most cases we found a highly similar genephenotype relationship as known from Arabidopsis, there were also striking differences in the regulation of trichome patterning, differentiation, and morphogenesis. Our analysis of trichome patterning suggests that the formation of two classes of trichomes is regulated differentially by the homeodomain transcription factor AaGL2. Moreover, we show that overexpression of the GL3 basic helix-loop-helix transcription factor in A. alpina leads to the opposite phenotype as described in A. thaliana. Mathematical modeling helps to explain how this nonintuitive behavior can be explained by different ratios of GL3 and GL1 in the two species.Arabis alpina | trichomes | genetic analysis
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.