Hydroxyproline (Hyp) O-glycosylation characterizes the hydroxyproline-rich glycoprotein (HRGP) superfamily of the plant extracellular matrix. Hyp glycosylation occurs in two modes: Arabinosylation adds short oligoarabinosides (Hyp-arabinosides) while galactosylation leads to the addition of larger arabinogalactan polysaccharides (Hyp-polysaccharides). We hypothesize that sequence-dependent glycosylation of small peptide motifs results in glycomodules. These small functional units in combination with other repetitive peptide modules define the properties of HRGPs. The Hyp contiguity hypothesis predicts arabinosylation of contiguous Hyp residues and galactosylation of clustered noncontiguous Hyp residues. To determine the minimum level of Hyp contiguity that directs arabinosylation, we designed a series of synthetic genes encoding repetitive (Ser-Pro 2 ) n , (Ser-Pro 3 ) n , and (Ser-Pro 4 ) n . A signal sequence targeted these endogenous substrates to the endoplasmic reticulum/Golgi for post-translational proline hydroxylation and glycosylation in transformed Nicotiana tabacum cells. The fusion glycoproteins also contained green fluorescence protein, facilitating their detection and isolation. The (Ser-Pro 2 ) n and (Ser-Hyp 4 ) n fusion glycoproteins yielded Hyp-arabinosides but no Hyp-polysaccharide. The motif (Ser-Pro 3 ) n was incompletely hydroxylated, yielding mixed contiguous/noncontiguous Hyp and a corresponding mixture of Hyparabinosides and Hyp-polysaccharides. These results plus circular dichroic spectra of the glycosylated and deglycosylated (Ser-Pro 2 ) n , (Ser-Pro 3 ) n , and (Ser-Pro 4 ) n modules corroborate the Hyp contiguity hypothesis and indicate that Hyp O-glycosylation is indeed sequence-driven.
Hydroxyproline-rich glycoproteins (HRGPs)1 participate in the plant extracellular matrix as networks, exudates, and glycocalyx, comprising a superfamily that includes extensins (1), proline-rich proteins (PRPs) (2) and arabinogalactan-proteins (AGPs) (3). The three major families are distinguished by characteristic repetitive structural motifs: Ser-Hyp 4 in extensins, Pro-Hyp-Val-Tyr-Lys repeats and variants in PRPs, and XaaHyp-Xaa-Hyp repeats plus the presence of arabinogalactan polysaccharide in AGPs.The major post-translational modifications of HRGPs, proline hydroxylation and subsequent O-Hyp glycosylation, determine the properties of HRGPs to a greater or lesser extent. Carbohydrate accounts for as much as 95% of the hyperglycosylated AGPs and about 60% of extensins, thus forming the interactive molecular surface. In the lightly glycosylated PRPs, however, sugar may contribute as little as 1% of the mass.O-Hyp glycosylation occurs in two distinct modes, Hyp arabinosylation (4) and Hyp galactosylation (5), respectively. Hyp arabinosylation of virtually all HRGPs results in short (usually 1-4 residues), neutral, linear homooligosaccharides of L-arabinofuranose (Hyp-arabinosides). Hyp galactosylation, which is restricted to the AGPs, results in addition of much larger arabinogalactan heteropolysac...