The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.
LORELEI and at least three members of the REVEILLE transcription factor family are important for LORELEI expression in synergid cells of the Arabidopsis female gametophyte.
Synergid cells in the micropylar end of the female gametophyte are required for critical cell-cell signaling interactions between the pollen tube and the ovule that precede double fertilization and seed formation in flowering plants. LORELEI (LRE) encodes a GPI-anchored protein that is expressed primarily in the synergid cells, and together with FERONIA, a receptor-like kinase, it controls pollen tube reception by the receptive synergid cell. Still, how LRE expression is controlled in synergid cells remains poorly characterized. We identified candidate cis-regulatory elements enriched in LRE and other synergid cell-expressed genes. One of the candidate motifs (‘TAATATCT’) in the LRE promoter was an uncharacterized variant of the Evening Element motif that we named as the Short Evening Element-like (SEEL) motif. Deletion or point mutations in the SEEL motif of the LRE promoter resulted in decreased reporter expression in synergid cells, demonstrating that the SEEL motif is important for expression of LRE in synergid cells. Additionally, we found that LRE expression is decreased in the loss of function mutants of REVEILLE (RVE) transcription factors, which are clock genes known to bind the SEEL and other closely related motifs. We propose that RVE transcription factors regulate LRE expression in synergid cells by binding to the SEEL motif in the LRE promoter. Identification of a cis-regulatory element and transcription factors involved in the expression of LRE will serve as a foundation to characterize the gene regulatory networks in synergid cells and investigate the potential connection between circadian rhythm and fertilization.One sentence summaryA newly identified SEEL motif in the promoter of LORELEI and at least three members of the REVEILLE transcription factor family are important for LORELEI expression in synergid cells of the Arabidopsis female gametophyte.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.