Tauopathies are neurodegenerative diseases that affect millions of people worldwide including those with Alzheimer's disease. While many efforts have focused on understanding the role of tau protein in neurodegeneration, there has been little done to systematically analyze and study the structures within tau's encoding RNA and their connection to disease pathology. Knowledge of RNA structure can provide insights into disease mechanisms and how to affect protein production for therapeutic benefit. Using computational methods based on thermodynamic stability and evolutionary conservation, we identified structures throughout the tau pre-mRNA, especially at exonintron junctions and within the 5′ and 3′ untranslated regions (UTRs). In particular, structures were identified at twenty exon-intron junctions. The 5′ UTR contains one structured region, which lies within a known internal ribosome entry site. The 3′ UTR contains eight structured regions, including one that contains a polyadenylation signal.A series of functional experiments were carried out to assess the effects of mutations associated with mis-regulation of alternative splicing of exon 10 and to identify regions of the 3′ UTR that contain cis-regulatory elements. These studies defined novel structural regions within the mRNA that affect stability and pre-mRNA splicing and may lead to new therapeutic targets for treating tau-associated diseases.
ABBREVIATIONSAD, Alzheimer's disease; APA, alternative polyadenylation; APSI, average pairwise sequence identity; bp, base pair; DMEM, Dulbecco's Modified Eagle Medium; ED, ensemble diversity; FTDP-17, frontotemporal dementia with parkinsonism-17; htra2β1, transformer 2 beta homolog 1; IRES, internal ribosome entry site; ITAF, IRES trans-acting factor; MAFFT, Multiple Alignment using Fast Fourier Transform; MAPT, microtubule associated protein tau; MBD, microtubule binding domain; MFE, minimum free energy; miRNA, microRNA; mRNA, messenger RNA; mTOR, mammalian target of rapamycin; MSTD, multiple system tauopathy with presenile dementia; ncRNA, noncoding RNA; nt, nucleotide; PD, Parkinson's disease; RNP, ribonucleoprotein;rRNA, ribosomal RNA; SCI, structure conservation index; SD, standard deviation; SF2, serine and arginine rich splicing factor 1, also known as SRSF1; SHAPE, selective 2′ hydroxyl acylation analyzed by primer extension; snRNA, small nuclear RNA; SRp30c, serine and arginine rich splicing factor 9, also known as SRSF9; SRp40, serine and arginine rich splicing factor 5, also known as SRSF5; SRp54, signal recognition particle 54; SRp55, serine and arginine rich splicing factor 6, also known as SRSF6; SVM, support vector machine; UTR, untranslated region