The final goal of the Russian part of the Chromosome-centric Human Proteome Project (C-HPP) was established as the analysis of the chromosome 18 (Chr 18) protein complement in plasma, liver tissue and HepG2 cells with the sensitivity of 10(-18) M. Using SRM, we have recently targeted 277 Chr 18 proteins in plasma, liver, and HepG2 cells. On the basis of the results of the survey, the SRM assays were drafted for 250 proteins: 41 proteins were found only in the liver tissue, 82 proteins were specifically detected in depleted plasma, and 127 proteins were mapped in both samples. The targeted analysis of HepG2 cells was carried out for 49 proteins; 41 of them were successfully registered using ordinary SRM and 5 additional proteins were registered using a combination of irreversible binding of proteins on CN-Br Sepharose 4B with SRM. Transcriptome profiling of HepG2 cells performed by RNAseq and RT-PCR has shown a significant correlation (r = 0.78) for 42 gene transcripts. A pilot affinity-based interactome analysis was performed for cytochrome b5 using analytical and preparative optical biosensor fishing followed by MS analysis of the fished proteins. All of the data on the proteome complement of the Chr 18 have been integrated into our gene-centric knowledgebase ( www.kb18.ru ).
We report the results obtained in 2012-2013 by the Russian Consortium for the Chromosome-centric Human Proteome Project (C-HPP). The main scope of this work was the transcriptome profiling of genes on human chromosome 18 (Chr 18), as well as their encoded proteome, from three types of biomaterials: liver tissue, the hepatocellular carcinoma-derived cell line HepG2, and blood plasma. The transcriptome profiling for liver tissue was independently performed using two RNaseq platforms (SOLiD and Illumina) and also by droplet digital PCR (ddPCR) and quantitative RT-PCR. The proteome profiling of Chr 18 was accomplished by quantitatively measuring protein copy numbers in the three types of biomaterial (the lowest protein concentration measured was 10(-13) M) using selected reaction monitoring (SRM). In total, protein copy numbers were estimated for 228 master proteins, including quantitative data on 164 proteins in plasma, 171 in the HepG2 cell line, and 186 in liver tissue. Most proteins were present in plasma at 10(8) copies/μL, while the median abundance was 10(4) and 10(5) protein copies per cell in HepG2 cells and liver tissue, respectively. In summary, for liver tissue and HepG2 cells a "transcriptoproteome" was produced that reflects the relationship between transcript and protein copy numbers of the genes on Chr 18. The quantitative data acquired by RNaseq, PCR, and SRM were uploaded into the "Update_2013" data set of our knowledgebase (www.kb18.ru) and investigated for linear correlations.
In this study, to reduce the number of major plasma components, we examined thermostable plasma fractions to search for a biomarker of ovarian cancer. An apparent cancer biomarker of 11.7 kDa was detected in these fractions using ProteinChip SELDI-TOF mass spectrometry system. This peak invariably appeared with another close peak of about 11.5 kDa, suggesting that it is a derivative of a larger mass molecule. Of 27 cancer plasma specimens, 15 (55.6%) demonstrated this peak pair, whereas only 2 of 34 controls specimens (5.8%) were shown to express it with low intensity. Using a method involving cysteine modification by 4-vinylpyridine (4-VP), 2-DE and HPLC, these peaks were identified by mass spectrometry as serum amyloid A1 (11.68 kDa) and its N-terminal arginine-truncated form (11.52 kDa).
Virtual and experimental 2DE coupled with ESI LC-MS/MS was introduced to obtain better representation of the information about human proteome. The proteins from HEPG2 cells and human blood plasma were run by 2DE. After staining and protein spot identification by MALDI-TOF MS, the protein maps were generated. The experimental physicochemical parameters (pI/Mw) of the proteoforms further detected by ESI LC-MS/MS in these spots were obtained. Next, the theoretical pI and Mw of identified proteins were calculated using program Compute pI/Mw (http://web.expasy.org/compute_pi/pi_tool-doc.html). Accordingly, the relationship between theoretical and experimental parameters was analyzed, and the correlation plots were built. Additionally, virtual/experimental information about different protein species/proteoforms from the same genes was extracted. As it was revealed from the plots, the major proteoforms detected in HepG2 cell line have pI/Mw parameters similar to theoretical values. In opposite, the minor protein species have mainly very different from theoretical pI and Mw parameters. A similar situation was observed in plasma in much higher degree. It means that minor protein species are heavily modified in cell and even more in plasma proteome.
A gene-centric approach was applied for a large-scale study of expression products of a single chromosome. Transcriptome profiling of liver tissue and HepG2 cell line was independently performed using two RNA-Seq platforms (SOLiD and Illumina) and also by Droplet Digital PCR (ddPCR) and quantitative RT-PCR. Proteome profiling was performed using shotgun LC-MS/MS as well as selected reaction monitoring with stable isotope-labeled standards (SRM/SIS) for liver tissue and HepG2 cells. On the basis of SRM/SIS measurements, protein copy numbers were estimated for the Chromosome 18 (Chr 18) encoded proteins in the selected types of biological material. These values were compared with expression levels of corresponding mRNA. As a result, we obtained information about 158 and 142 transcripts for HepG2 cell line and liver tissue, respectively. SRM/SIS measurements and shotgun LC-MS/MS allowed us to detect 91 Chr 18-encoded proteins in total, while an intersection between the HepG2 cell line and liver tissue proteomes was ∼66%. In total, there were 16 proteins specifically observed in HepG2 cell line, while 15 proteins were found solely in the liver tissue. Comparison between proteome and transcriptome revealed a poor correlation (R ≈ 0.1) between corresponding mRNA and protein expression levels. The SRM and shotgun data sets (obtained during 2015-2016) are available in PASSEL (PASS00697) and ProteomeExchange/PRIDE (PXD004407). All measurements were also uploaded into the in-house Chr 18 Knowledgebase at http://kb18.ru/protein/matrix/416126 .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.