HnRNP C is a ubiquitous RNA regulatory factor and the principal constituent of the nuclear hnRNP core particle. The protein contains one amino-terminal RNA recognition motif (RRM) known to bind uridine (U)-rich sequences. This work provides a molecular and mechanistic understanding of this interaction. We solved the solution structures of the RRM in complex with poly(U) oligomers of five and seven nucleotides. The five binding pockets of RRM recognize uridines with an unusual 5'-to-3' gradient of base selectivity. The target recognition is therefore strongly sensitive to base clustering, explaining the preference for contiguous uridine tracts. Using a novel approach integrating the structurally derived recognition consensus of the RRM with a thermodynamic description of its multi-register binding, we modeled the saturation of cellular uridine tracts by this protein. The binding pattern is remarkably consistent with the experimentally observed transcriptome-wide cross-link distribution of the full-length hnRNP C on short uridine tracts. This result re-establishes the RRM as the primary RNA-binding domain of the hnRNP C tetramer and provides a proof of concept for interpreting high-throughput interaction data using structural approaches.
The human hnRNP C is a ubiquitous cellular protein involved in mRNA maturation. Recently, we have shown that this protein specifically recognizes uridine (U) pentamers through its single RNA recognition motif (RRM). However, a large fraction of natural RNA targets of hnRNP C consists of much longer contiguous uridine stretches. To understand how these extended sites are recognized, we studied the binding of the RRM to U-tracts of 8-11 bases. In vivo investigation of internal translation activation of unr (upstream of N-ras) mRNA indicates that the conservation of the entire hnRNP C binding site, UC(U) 8 , is required for hnRNP C-dependent IRES activation. The assays further suggest a synergistic interplay between hnRNP C monomers, dependent on the protein's ability to oligomerize. In vitro spectroscopic and thermodynamic analyses show that isolated RRMs bind to (U) 11 oligomers as dimers. Structural modeling of a ternary double-RRM/RNA complex indicates additionally that two RRM copies can be accommodated on the canonical sequence UC(U) 8 . The proposed tandem RRM binding is in very good agreement with the transcriptome-wide recognition of extended U-tracts by full-length hnRNP C, which displays a cross-linking pattern consistent with a positively cooperative RRM dimer binding model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.