Upstream open reading frames (uORFs) are established as cis-acting elements for eukaryotic translation of annotated ORFs (anORFs) located on the same mRNAs. Here, we identified a mammalian uORF with functions that are independent from anORF translation regulation. Bioinformatics screening using ribosome profiling data of human and mouse brains yielded 308 neurologically vital genes from which anORF and uORFs are polycistronically translated in both species. Among them, Arhgef9 contains a uORF named SPICA, which is highly conserved among vertebrates and stably translated only in specific brain regions of mice. Disruption of SPICA translation by ATG-to-TAG substitutions did not perturb translation or function of its anORF product, collybistin. SPICA-null mice displayed abnormal maternal reproductive performance and enhanced anxiety-like behavior, characteristic of ARHGEF9-associated neurological disorders. This study demonstrates that mammalian uORFs can be independent genetic units, revising the prevailing dogma of the monocistronic gene in mammals, and even eukaryotes.
Keywordsgene expression, upstream open reading frame (uORF), polycistronic translation, behavioral control
RESULTS
Bicistronic Genes in Human and Mouse BrainsWe first used Ribo-seq data for human and mouse brains (Gonzalez et al., 2014) to search for mammalian mRNAs from which uORF regions and anORFs on individual mRNAs were co-translated at equivalent levels, with the assumption that some uORFs on these mRNAs could be translated in a manner that is not coordinated with anORF translation ( Figure 1A). We obtained 13,602 human and 8,246 mouse mRNAs that are translated mRNAs in the brain, to which either an anORF or merged uORF ( Figure 1B), or both were mapped with reads of at least 1 read per kilobase per million mapped reads (RPKM). Of these, 4,728 human and 4,767 mouse mRNAs (2,522 human and 3,173 mouse genes, respectively) were loaded with translating ribosomes at equivalent densities on both anORFs and the merged uORFs (termed "polycistronic genes"; Figure 1, C and D).Gene ontology (GO) enrichment analysis showed that these polycistronic genes were significantly associated with neural GO terms in both species, especially terms related to the synapse, as compared with all the 13,602 human and 8,246 mouse mRNAs that are translated mRNAs in the brain as the background (Figure 1, E and F). This implies that some of these brain-specific polycistronic genes might be evolutionarily adapted to use uORF translation for establishing and/or regulating synaptic communication underlying brain function.The 1,439 genes that were polycistronically translated in both human and mouse brains were then screened for genes that were previously shown to regulate brain functions by knockout mouse studies, as specified in the Mouse Genome Informatics database (Blake et al., 2009), which resulted in the identification of 308 genes. After filtering the 308 genes based on their length and conservation of amino acid sequences coded in the uORFs (≥31 codons in length and ≥ ...