Abbreviations: AMOP, adhesion-associated domain in MUC4 and other proteins domain; CH, charge hydropathy; D 2 P 2 , database of disordered protein predictions; ECM, extracellular matrix; EGF, epidermal growth factor-like domain; IDP, intrinsically disordered protein; IDR, intrinsically disordered region; MoRF, molecular recognition feature; NIDO, nidogen-like domain; PPI, protein-protein interaction; PTM, posttranslational modification; PTS sequence, proline, threonine and serine sequence; SEA, sea-urchin sperm protein enterokinase and agrin module; TM, transmembrane; VNTR, variable number of tandem repeat domain; vWD, von Willebrand factor D domain.
AbstractMany members of the mucin family are evolutionarily conserved and are often aberrantly expressed and glycosylated in various benign and malignant pathologies leading to tumor invasion, metastasis, and immune evasion. The large size and extensive glycosylation present challenges to study the mucin structure using traditional methods, including crystallography. We offer the hypothesis that the functional versatility of mucins may be attributed to the presence of intrinsically disordered regions (IDRs) that provide dynamism and flexibility and that the IDRs offer potential therapeutic targets. Herein, we examined the links between the mucin structure and function based on IDRs, posttranslational modifications (PTMs), and potential impact on their interactome. Using sequence-based bioinformatics tools, we observed that mucins are predicted to be moderately (20%-40%) to highly (>40%) disordered and many conserved mucin domains could be disordered. Phosphorylation sites overlap with IDRs throughout the mucin sequences. Additionally, the majority of predicted O-and N-glycosylation sites in the tandem repeat regions occur within IDRs and these IDRs contain a large number of functional motifs, that is, molecular recognition features (MoRFs), which directly influence protein-protein interactions (PPIs).This investigation provides a novel perspective and offers an insight into the complexity and dynamic nature of mucins.
K E Y W O R D Sintrinsically disordered protein, glycoprotein, protein-protein interaction, protein structure 1940 | CARMICHEAL Et AL.