Measurements of mass spectral peak intensities and spectral counts are promising methods for quantifying protein abundance changes in shotgun proteomic analyses. We describe Serac, software developed to evaluate the ability of each method to quantify relative changes in protein abundance. Dynamic range and linearity using a three-dimensional ion trap were tested using standard proteins spiked into a complex sample. Linearity and good agreement between observed versus expected protein ratios were obtained after normalization and background subtraction of peak area intensity measurements and correction of spectral counts to eliminate discontinuity in ratio estimates. Peak intensity values useful for protein quantitation ranged from 10 7 to 10 11 counts with no obvious saturation effect, and proteins in replicate samples showed variations of less than 2-fold within the 95% range (؎2) when >3 peptides/protein were shared between samples. Protein ratios were determined with high confidence from spectral counts when maximum spectral counts were >4 spectra/protein, and replicates showed equivalent measurements well within 95% confidence limits. In further tests, complex samples were separated by gel exclusion chromatography, quantifying changes in protein abundance between different fractions. Linear behavior of peak area intensity measurements was obtained for peptides from proteins in different fractions. Protein ratios determined by spectral counting agreed well with those determined from peak area intensity measurements, and both agreed with independent measurements based on gel staining intensities. Overall spectral counting proved to be a more sensitive method for detecting proteins that undergo changes in abundance, whereas peak area intensity measurements yielded more accurate estimates of protein ratios. Finally these methods were used to analyze differential changes in protein expression in human erythroleukemia K562 cells stimulated under conditions that promote cell differentiation by mitogenactivated protein kinase pathway activation. Protein changes identified with p < 0
Identifying proteins in cell extracts by shotgun proteomics involves digesting the proteins, sequencing the resulting peptides by data-dependent mass spectrometry (MS/MS), and searching protein databases to identify the proteins from which the peptides are derived. Manual analysis and direct spectral comparison reveal that scores from two commonly used search programs (Sequest and Mascot) validate less than half of potentially identifiable MS/MS spectra (class positive) from shotgun analyses of the human erythroleukemia K562 cell line. Here we demonstrate increased sensitivity and accuracy using a focused search strategy along with a peptide sequence validation script that does not rely exclusively on XCorr or Mowse scores generated by Sequest or Mascot, but uses consensus between the search programs, along with chemical properties and scores describing the nature of the fragmentation spectrum (ion score and RSP). The approach yielded 4.2% false positive and 8% false negative frequencies in peptide assignments. The protein profile is then assembled from peptide assignments using a novel peptide-centric protein nomenclature that more accurately reports protein variants that contain identical peptide sequences. An Isoform Resolver algorithm ensures that the protein count is not inflated by variants in the protein database, eliminating approximately 25% of redundant proteins. Analysis of soluble proteins from a human K562 cells identified 5130 unique proteins, with approximately 100 false positive protein assignments.
Correct identification of a peptide sequence from MS/MS data is still a challenging research problem, particularly in proteomic analyses of higher eukaryotes where protein databases are large. The scoring methods of search programs often generate cases where incorrect peptide sequences score higher than correct peptide sequences (referred to as distraction). Because smaller databases yield less distraction and better discrimination between correct and incorrect assignments, we developed a method for editing a peptide-centric database (PC-DB) to remove unlikely sequences and strategies for enabling search programs to utilize this peptide database. Rules for unlikely missed cleavage and nontryptic proteolysis products were identified by data mining 11 849 high-confidence peptide assignments. We also evaluated ion exchange chromatographic behavior as an editing criterion to generate subset databases. When used to search a well-annotated test data set of MS/MS spectra, we found no loss of critical information using PC-DBs, validating the methods for generating and searching against the databases. On the other hand, improved confidence in peptide assignments was achieved for tryptic peptides, measured by changes in DeltaCN and RSP. Decreased distraction was also achieved, consistent with the 3-9-fold decrease in database size. Data mining identified a major class of common nonspecific proteolytic products corresponding to leucine aminopeptidase (LAP) cleavages. Large improvements in identifying LAP products were achieved using the PC-DB approach when compared with conventional searches against protein databases. These results demonstrate that peptide properties can be used to reduce database size, yielding improved accuracy and information capture due to reduced distraction, but with little loss of information compared to conventional protein database searches.
An important strategy for "shotgun proteomics" profiling involves solution proteolysis of proteins, followed by peptide separation using multidimensional liquid chromatography and automated sequencing by mass spectrometry (LC-MS/MS). Several protocols for extracting and handling membrane proteins for shotgun proteomics experiments have been reported, but few direct comparisons of different protocols have been reported. We compare four methods for preparing membrane proteins from human cells, using acid labile surfactants (ALS), urea, and mixed organic-aqueous solvents. These methods were compared with respect to their efficiency of protein solubilization and proteolysis, peptide and protein recovery, membrane protein enrichment, and peptide coverage of transmembrane proteins. Overall, ∼50-60% of proteins recovered were membrane-associated, identified from Gene Ontology annotations and transmembrane prediction software. Samples extracted with ALS, extracted with urea followed by dilution, or extracted with urea followed by desalting yielded comparable peptide recoveries and sequence coverage of transmembrane proteins. In contrast, suboptimal proteolysis was observed with organic solvent. Urea extraction followed by desalting may be a particularly useful approach, as it is less costly than ALS and yields satisfactory protein denaturation and proteolysis under conditions that minimize reactivity with urea-derived cyanate. Spectral counting was used to compare datasets of proteins from membrane samples with those of soluble proteins from K562 cells, and to estimate fold differences in protein abundances. Proteins most highly abundant in the membrane samples showed enrichment of integral membrane protein identifications, consistent with their isolation by differential centrifugation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.