As proteome-wide C-terminal sequence analysis has been largely intractable, we developed a polymer-based enrichment approach to profile protein C-terminal peptides by mass spectrometry and identified hundreds of C-terminal peptides in the Escherichia coli proteome. We isotopically labeled GluC protease-digested and undigested samples and identified GluC substrates and their cleavage sites by quantification of neo-C-terminal peptides. Our method thus enables global annotation of protein C-terminal posttranslational modifications, including proteolytic truncations.
Astacins are secreted and membrane-bound metalloproteases with clear associations to many important pathological and physiological processes. Yet with only a few substrates described their biological roles are enigmatic. Moreover, the lack of knowledge of astacin cleavage site specificities hampers assay and drug development. Using PICS (proteomic identification of protease cleavage site specificity) and TAILS (terminal amine isotopic labeling of substrates) degradomics approaches >3000 cleavage sites were proteomically identified for five different astacins. Such broad coverage enables family-wide determination of specificities N- and C-terminal to the scissile peptide bond. Remarkably, meprin α, meprin β, and LAST_MAM proteases exhibit a strong preference for aspartate in the peptide (P)1′ position because of a conserved positively charged residue in the active cleft subsite (S)1′. This unparalleled specificity has not been found for other families of extracellular proteases. Interestingly, cleavage specificity is also strongly influenced by proline in P2′ or P3′ leading to a rare example of subsite cooperativity. This specificity characterizes the astacins as unique contributors to extracellular proteolysis that is corroborated by known cleavage sites in procollagen I+III, VEGF (vascular endothelial growth factor)-A, IL (interleukin)-1β, and pro-kallikrein 7. Indeed, cleavage sites in VEGF-A and pro-kallikrein 7 identified by terminal amine isotopic labeling of substrates matched those reported by Edman degradation. Moreover, the novel substrate FGF-19 was validated biochemically and shown to exhibit altered biological activity after meprin processing.
To link cleaved substrates in complex systems with a specific protease, the protease active site specificity is required. Proteomic identification of cleavage sites (PICS) simultaneously determines both the prime- and non-prime-side specificities of individual proteases through identification of hundreds of individual cleavage sequences from biologically relevant, proteome-derived peptide libraries. PICS also identifies subsite cooperativity. To generate PICS peptide libraries, cellular proteomes are digested with a specific protease such as trypsin. Following protease inactivation, primary amines are protected. After incubation with a test protease, each prime-side cleavage fragment has a free newly formed N-terminus, which is biotinylated for affinity isolation and identification by liquid chromatography-tandem mass spectrometry. The corresponding non-prime sequences are derived bioinformatically. The step-by-step protocol also presents a web service for PICS data analysis, as well as introducing and validating PICS peptide libraries made from Escherichia coli.
BackgroundType II transmembrane serine proteases (TTSPs) are a family of cell membrane tethered serine proteases with unclear roles as their cleavage site specificities and substrate degradomes have not been fully elucidated. Indeed just 52 cleavage sites are annotated in MEROPS, the database of proteases, their substrates and inhibitors.Methodology/Principal FindingTo profile the active site specificities of the TTSPs, we applied Proteomic Identification of protease Cleavage Sites (PICS). Human proteome-derived database searchable peptide libraries were assayed with six human TTSPs (matriptase, matriptase-2, matriptase-3, HAT, DESC and hepsin) to simultaneously determine sequence preferences on the N-terminal non-prime (P) and C-terminal prime (P’) sides of the scissile bond. Prime-side cleavage products were isolated following biotinylation and identified by tandem mass spectrometry. The corresponding non-prime side sequences were derived from human proteome databases using bioinformatics. Sequencing of 2,405 individual cleaved peptides allowed for the development of the family consensus protease cleavage site specificity revealing a strong specificity for arginine in the P1 position and surprisingly a lysine in P1′ position. TTSP cleavage between R↓K was confirmed using synthetic peptides. By parsing through known substrates and known structures of TTSP catalytic domains, and by modeling the remainder, structural explanations for this strong specificity were derived.ConclusionsDegradomics analysis of 2,405 cleavage sites revealed a similar and characteristic TTSP family specificity at the P1 and P1′ positions for arginine and lysine in unfolded peptides. The prime side is important for cleavage specificity, thus making these proteases unusual within the tryptic-enzyme class that generally has overriding non-prime side specificity.
Proteome-wide analysis of protein C-termini has long been inaccessible, but is now enabled by a newly developed negative selection strategy we term C-terminomics. In this procedure, amine- and carboxyl groups of full-length proteins are chemically protected. After trypsin digestion, N-terminal and internal tryptic peptides - but not C-terminal peptides - posses newly formed, unprotected C-termini that are removed by coupling to the high-molecular-weight polymer poly-allylamine. Ultrafiltration separates the uncoupled, blocked C-terminal peptides that are subsequently analyzed by liquid chromatography-tandem mass spectrometry. On a proteome-wide scale, this strategy profiles native protein C-termini together with neo C-termini generated by endoproteolytic cleavage or processive C-terminal truncations ("ragging"). In bacterial proteomes, hundreds of protein C-termini were identified. Stable isotope labeling enables -quantitative comparison of protein C-termini and C-terminal processing in different samples. Using formaldehyde-based chemical labeling, this quantitative approach termed "carboxy-terminal amine-based isotope labeling of substrates (C-TAILS)" identified >100 cleavage sites of exogenously applied GluC protease in an Escherichia coli proteome. C-TAILS complements recently developed N-terminomic techniques for endoprotease substrate discovery and is essential for the characterization of carboxyprotease processing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.