Photo-induced cross-linking is a mainstay technique to characterize RNA-protein interactions. However, UV-induced cross-linking between RNA and proteins at “zero-distance” is poorly understood. Here, we investigate cross-linking of the RBFOX alternative splicing factor with its hepta-ribonucleotide binding element as a model system. We examine the influence of nucleobase, nucleotide position and amino acid composition using CLIR-MS technology (crosslinking-of-isotope-labelled-RNA-and-tandem-mass-spectrometry), that locates cross-links on RNA and protein with site-specific resolution. Surprisingly, cross-linking occurs only at nucleotides that are π-stacked to phenylalanines. Notably, this π-stacking interaction is also necessary for the amino-acids flanking phenylalanines to partake in UV-cross-linking. We confirmed these observations in several published datasets where cross-linking sites could be mapped to a high resolution structure. We hypothesize that π-stacking to aromatic amino acids activates cross-linking in RNA-protein complexes, whereafter nucleotide and peptide radicals recombine. These findings will facilitate interpretation of cross-linking data from structural studies and from genome-wide datasets generated using CLIP (cross-linking-and-immunoprecipitation) methods.
Cross-linking coupled with mass spectrometry is an increasingly popular methodology for elucidating structural information from biological complexes. Whilst protein-protein cross-linking workflows are widely used and well characterised, adoption of protein-RNA cross-linking workflows for structural studies is less widespread, and data produced from such experiments remains less well understood. The cross-linking of stable isotope labelled RNA coupled to mass spectrometry (CLIR-MS) workflow uses isotope labelled RNA to simultaneously confirm that peptides are cross-linked to RNA and aid cross-link localisation in an RNA sequence. For broader application of CLIR-MS as part of the structural analysis of ribonucleoproteins, the method must be sensitive, robust, and its reaction products need to be well characterised. We enhanced our previously published workflow, improving coverage and sensitivity. We used it to infer common properties of protein-RNA cross-links such as cross-linking distance, and to assess the impact of substitution of uracil with 4-thio-uracil in structural proteomics experiments. We profiled the compositional diversity of RNA-derived peptide modifications, and subsequently defined a more inclusive data analysis approach which more than doubles the number of cross-link spectrum matches compared with our past work. We defined distance restraints from these cross-links, and with the aid of visualisation software, demonstrated that on their own they provide sufficient information to localise an RNA chain to the correct position on the surface of a protein. We applied our enhanced workflow and understanding to characterise the binding interface of several protein-RNA complexes containing classical and uncommon RNA binding domains. The enhanced sensitivity and understanding demonstrated here underpin a wider adoption of protein-RNA cross-linking in structural biology.
In mammals, the structural basis for the interaction between U1 and U2 small nuclear ribonucleoproteins (snRNPs) during the early steps of splicing is still elusive. The binding of the ubiquitin-like (UBL) domain of SF3A1 to the stem-loop 4 of U1 snRNP (U1-SL4) contributes to this interaction. Here, we determined the 3D structure of the complex between the UBL of SF3A1 and U1-SL4 RNA. Our crystallography, NMR spectroscopy, and cross-linking mass spectrometry data show that SF3A1-UBL recognizes, sequence specifically, the GCG/CGC RNA stem and the apical UUCG tetraloop of U1-SL4. In vitro and in vivo mutational analyses support the observed intermolecular contacts and demonstrate that the carboxyl-terminal arginine-glycine-glycine-arginine (RGGR) motif of SF3A1-UBL binds sequence specifically by inserting into the RNA major groove. Thus, the characterization of the SF3A1-UBL/U1-SL4 complex expands the repertoire of RNA binding domains and reveals the capacity of RGG/RG motifs to bind RNA in a sequence-specific manner.
RNA−protein interactions mediate many intracellular processes. CLIR-MS (cross-linking of isotope-labeled RNA and tandem mass spectrometry) allows the identification of RNA−protein interaction sites at single nucleotide/amino acid resolution in a single experiment. Using isotopically labeled RNA segments for UV-light-induced cross-linking generates characteristic isotope patterns that constrain the sequence database searches, increasing spatial resolution. Whereas the use of segmentally isotopically labeled RNA is effective, it is technically involved and not applicable in some settings, e.g., in cell or tissue samples. Here we introduce an extension of the CLIR-MS workflow that uses unlabeled RNA during cross-linking and subsequently adds an isotopic label during sample preparation for MS analysis. After RNase and protease digests of a cross-linked complex, the nucleic acid part of a peptide−RNA conjugate is labeled using the enzyme T4 polynucleotide kinase and a 1:1 mixture of heavy 18 O 4 -γ-ATP and light ATP. In this simple, one-step reaction, three heavy oxygen atoms are transferred from the γphosphate to the 5′-end of the RNA, introducing an isotopic shift of 6.01 Da that is detectable by mass spectrometry. We applied this approach to the RNA recognition motif (RRM) of the protein FOX1 in complex with its cognate binding substrate, FOXbinding element (FBE) RNA. We also labeled a single phosphate within an RNA and unambiguously determined the cross-linking site of the FOX1-RRM binding to FBE at single residue resolution on the RNA and protein level and used differential ATP labeling for relative quantification based on isotope dilution. Data are available via ProteomeXchange with the identifier PXD024010.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.