SummaryBiochemical and cytogenetic experiments have led to the hypothesis that eukaryotic chromatin is organized into a series of distinct domains that are functionally independent. Two expectations of this hypothesis are: (i) adjacent genes are more frequently co-expressed than is expected by chance; and (ii) co-expressed neighbouring genes are often functionally related. Here we report that over 10% of Arabidopsis thaliana genes are within large, co-expressed chromosomal regions. Two per cent (497/22 520) of genes are highly coexpressed (r > 0.7), about five times the number expected by chance. These genes fall into 226 groups distributed across the genome, and each group typically contains two to three genes. Among the highly coexpressed groups, 40% (91/226) have genes with high amino acid sequence similarity. Nonetheless, duplicate genes alone do not explain the observed levels of co-expression. Co-expressed, non-homologous genes are transcribed in parallel, share functions, and lie close together more frequently than expected. Our results show that the A. thaliana genome contains domains of gene expression. Small domains have highly co-expressed genes that often share functional and sequence similarity and are probably co-regulated by nearby regulatory sequences. Genes within large, significantly correlated groups are typically co-regulated at a low level, suggesting the presence of large chromosomal domains.
Pairs of genes within eukaryotic genomes are often located on opposite DNA strands such that transcription generates cisnatural sense antisense transcripts (cis-NATs). This orientation of genes has been associated with the biogenesis of splice variants and natural antisense small RNAs. Here, in an analysis of currently available data, we report that within Arabidopsis (Arabidopsis thaliana), protein-coding cis-NATs are also characterized by high abundance, high coexpression, and broad expression. Our results suggest that a permissive chromatin environment may have led to the proximity of these genes. Compared with other genes, cis-NAT-encoding genes have enriched low-nucleosome-density regions, high levels of histone H3 lysine-9 acetylation, and low levels of H3 lysine-27 trimethylation. Promoters associated with broadly expressed genes are preferentially found in the 59 regulatory sequences of cis-NAT-encoding genes. Our results further suggest that natural antisense small RNA production from cis-NATs is limited. Small RNAs sequenced from natural antisense small RNA biogenesis mutants including dcl1, dcl2, dcl3, and rdr6 map to cis-NATs as frequently as small RNAs sequenced from wild-type plants. Future work will investigate if the positive transcriptional regulation of overlapping protein-coding genes contributes to the prevalence of these genes within other eukaryotic genomes.
microRNAs (miRNAs) are small, endogenous RNAs of 20∼25 nucleotides, processed from stem-loop regions of longer RNA precursors. Plant miRNAs act as negative regulators of target mRNAs predominately by slicing target transcripts, and a number of miRNAs play important roles in development. We analyzed a number of published datasets from Arabidopsis thaliana to characterize novel miRNAs, novel miRNA targets, and miRNA-regulated developmental changes in gene expression. These data include microarray profiling data and small RNA (sRNA) deep sequencing data derived from miRNA biogenesis/transport mutants, microarray profiling data of mRNAs in a developmental series, and computational predictions of conserved genomic stem-loop structures. Our conservative analyses identified five novel mature miRNAs and seven miRNA targets, including one novel target gene. Two complementary miRNAs that target distinct mRNAs were encoded by one gene. We found that genes targeted by known miRNAs, and genes up-regulated or down-regulated in miRNA mutant inflorescences, are highly expressed in the wild type inflorescence. In addition, transcripts upregulated within the mutant inflorescences were abundant in wild type leaves and shoot meristems and low in pollen and seed. Downregulated transcripts were abundant in wild type pollen and seed and low in shoot meristems, roots and leaves. Thus, disrupting miRNA function causes the inflorescence transcriptome to resemble the leaf and meristem and to differ from pollen and seed. Applications of our computational approach to other species and the use of more liberal criteria than reported here will further expand the number of identified miRNAs and miRNA targets. Our findings suggest that miRNAs have a global role in promoting vegetative to reproductive transitions in A. thaliana.
Background Genetic variation for gene expression is a source of phenotypic variation for natural and agricultural species. The common approach to map and to quantify gene expression from genetically distinct individuals is to assign their RNA-seq reads to a single reference genome. However, RNA-seq reads from alleles dissimilar to this reference genome may fail to map correctly, causing transcript levels to be underestimated. Presently, the extent of this mapping problem is not clear, particularly in highly diverse species. We investigated if mapping bias occurred and if chromosomal features associated with mapping bias. Zea mays presents a model species to assess these questions, given it has genotypically distinct and well-studied genetic lines. Results In Zea mays, the inbred B73 genome is the standard reference genome and template for RNA-seq read assignments. In the absence of mapping bias, B73 and a second inbred line, Mo17, would each have an approximately equal number of regulatory alleles that increase gene expression. Remarkably, Mo17 had 2–4 times fewer such positively acting alleles than did B73 when RNA-seq reads were aligned to the B73 reference genome. Reciprocally, over one-half of the B73 alleles that increased gene expression were not detected when reads were aligned to the Mo17 genome template. Genes at dissimilar chromosomal ends were strongly affected by mapping bias, and genes at more similar pericentromeric regions were less affected. Biased transcript estimates were higher in untranslated regions and lower in splice junctions. Bias occurred across software and alignment parameters. Conclusions Mapping bias very strongly affects gene transcript abundance estimates in maize, and bias varies across chromosomal features. Individual genome or transcriptome templates are likely necessary for accurate transcript estimation across genetically variable individuals in maize and other species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.