Abstract:We have compiled from literature and other sources a list of 1261 proteins believed to be differentially expressed in human cancer. These proteins, only some of which have been detected in plasma to date, represent a population of candidate plasma biomarkers that could be useful in early cancer detection and monitoring given suffi ciently sensitive specifi c assays. We have begun to prioritize these markers for future validation by frequency of literature citations, both total and as a function of time. The candidates include proteins involved in oncogenesis, angiogenesis, development, differentiation, proliferation, apoptosis, hematopoiesis, immune and hormonal responses, cell signaling, nucleotide function, hydrolysis, cellular homing, cell cycle and structure, the acute phase response and hormonal control. Many have been detected in studies of tissue or nuclear components; nevertheless we hypothesize that most if not all should be present in plasma at some level. Of the 1261 candidates only 9 have been approved as "tumor associated antigens" by the FDA. We propose that systematic collection and large-scale validation of candidate biomarkers would fi ll the gap currently existing between basic research and clinical use of advanced diagnostics.
We have merged four different views of the human plasma proteome, based on different methodologies, into a single nonredundant list of 1175 distinct gene products. The methodologies used were 1) literature search for proteins reported to occur in plasma or serum; 2) multidimensional chromatography of proteins followed by two-dimensional electrophoresis and mass spectroscopy (MS) identification of resolved proteins; 3) tryptic digestion and multidimensional chromatography of peptides followed by MS identification; and 4) tryptic digestion and multidimensional chromatography of peptides from low-molecularmass plasma components followed by MS identification. Of 1,175 nonredundant gene products, 195 were included in more than one of the four input datasets. Only 46 appeared in all four. Predictions of signal sequence and transmembrane domain occurrence, as well as Genome Ontology annotation assignments, allowed characterization of the nonredundant list and comparison of the data sources. The "nonproteomic" literature (468 input proteins) is strongly biased toward signal sequence-containing extracellular proteins, while the three proteomics methods showed a much higher representation of cellular proteins, including nuclear, cytoplasmic, and kinesin complex proteins. Cytokines and protein hormones were almost completely absent from the proteomics data (presumably due to low abundance), while categories like DNA-binding proteins were almost entirely absent from the literature data (perhaps unexpected and therefore not sought). Most major categories of proteins in the human proteome are represented in plasma, with the distribution at successively deeper layers shifting from mostly extracellular to a distribution more like the whole (primarily cellular) proteome. The resulting nonredundant list confirms the presence of a number of interesting candidate marker proteins in plasma and serum.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.