This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Ser/Arg-rich splicing factor 1 (SRSF1 or ASF/SF2) is
the prototypical
member of SR proteins. SRSF1 binds to exonic splicing enhancers, which
prompts inclusion of corresponding exons in the mature mRNA. The RNA-binding
domain of SRSF1 consists of tandem RNA-recognition motifs (RRM1 and
RRM2) separated by a 30 amino acid long linker. In this study, we
investigate roles of RRM1, RRM2, and the linker in RNA binding. We
find that although both RRMs are crucial to RNA binding, RRM2 plays
the dominant role. The linker mildly contributes to RNA binding and
remains flexible in the RNA-bound state. Flexibility of the linker
allows the RRM1-cognate motif to be either upstream or downstream
of the RRM2-cognate motif. In addition, we find that the spacer length
between the bipartite motifs varies from 0 to 10 nucleotides. Our
binding assays reveal that SRSF1 prefers RNA sequences with shorter
spacers and the RRM1-cognate motif being placed upstream. Restrained
by nuclear magnetic resonance data, we simulate RNA-bound complexes
and demonstrate how tandem RRMs bind to RNA of different spacer lengths
and swapped bipartite motifs. We find that when the RRM1-cognate motif
is placed downstream, either the RRM1/RRM2 linker needs to be more
extended or RNA needs to form a U turn, which may reduce conformational
entropy. Our study suggests that the RNA-binding specificity of SRSF1
is broader than traditionally recapitulated by consensus sequences
of 7 to 8 nucleotides. Instead, centered on the RRM2-cognate motif,
an RNA fragment encompassing 10-nucleotide upstream and downstream
should be scrutinized.
Phase separation plays crucial roles in both sustaining cellular function and perpetuating disease states. Despite extensive studies, our understanding of this process is hindered by low solubility of phase-separating proteins. One example of this is found in SR and SR-related proteins. These proteins are characterized by domains rich in arginine and serine (RS domains), which are essential to alternative splicing and in vivo phase separation. However, they are also responsible for a low solubility that has made these proteins difficult to study for decades. Here, we solubilize the founding member of the SR family, SRSF1, by introducing a peptide mimicking RS repeats as a co-solute. We find that this RS-mimic peptide forms interactions similar to those of the protein's RS domain. Both interact with a combination of surface-exposed aromatic residues and acidic residues on SRSF1's RNA Recognition Motifs (RRMs) through electrostatic and cation-pi interactions. Analysis of RRM domains from human SR proteins indicates that these sites are conserved across the protein family. In addition to opening an avenue to previously unavailable proteins, our work provides insight into how SR proteins phase separate and participate in nuclear speckles.
Phase separation plays crucial roles in both sustaining cellular function and perpetuating disease states. Despite extensive studies, our understanding of this process is hindered by low solubility of phase-separating proteins. One example of this is found in SR proteins. These proteins are characterized by domains righ in arginine and serine (RS domains), which are essential to alternative splicing, in vivo phase separation, and a low solubility that has made these proteins difficult to study for decades. Here, we solubilize the founding member of the SR family, SRSF1, by introducing a peptide mimicking RS repeats as a cosolute. We find that this RS-mimic peptide forms interactions similar to those of the protein's RS domain. Both interact with a combination of surface-exposed aromatic residues and acidic residues on SRSF1's RNA Recognition Motifs (RRMs) through simultaneous electrostatic and cation-pi bonding. Analysis of RRM domains spanning the human proteome indicates that RRM domains involved in phase separation have more exposed aromatic residues. Further, in phase-separating proteins containing RS repeats, exposed aromatic residues are frequently surrounded by acidic residues. Our work provides insight into how SR proteins phase separate and opens an avenue to a range of previously unavailable proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.