This work discusses bioinformatics and experimental approaches to explore the human proteome, a constellation of proteins expressed in different tissues and organs. As the human proteome is not a static entity, it seems necessary to estimate the number of different protein species (proteoforms) and measure the number of copies of the same protein in a specific tissue. Here, meta-analysis of neXtProt knowledge base is proposed for theoretical prediction of the number of different proteoforms that arise from alternative splicing (AS), single amino acid polymorphisms (SAPs), and posttranslational modifications (PTMs). Three possible cases are considered: (1) PTMs and SAPs appear exclusively in the canonical sequences of proteins, but not in splice variants; (2) PTMs and SAPs can occur in both proteins encoded by canonical sequences and in splice variants; (3) all modification types (AS, SAP, and PTM) occur as independent events. Experimental validation of proteoforms is limited by the analytical sensitivity of proteomic technology. A bell-shaped distribution histogram was generated for proteins encoded by a single chromosome, with the estimation of copy numbers in plasma, liver, and HepG2 cell line. The proposed metabioinformatics approaches can be used for estimation of the number of different proteoforms for any group of protein-coding genes.
The alphabet of building blocks for RNA molecules is much larger than the standard four nucleotides. The diversity is achieved by the post-transcriptional biochemical modification of these nucleotides into distinct chemical entities that are structurally and functionally different from their unmodified counterparts. Some of these modifications are constituent and critical for RNA functions, while others serve as dynamic markings to regulate the fate of specific RNA molecules. Together, these modifications form the epitranscriptome, an essential layer of cellular biochemistry. As of the time of writing this review, more than 300 distinct RNA modifications from all three life domains have been identified. However, only a few of the most well-established modifications are included in most reviews on this topic. To provide a complete overview of the current state of research on the epitranscriptome, we analyzed the extent of the available information for all known RNA modifications. We selected 25 modifications to describe in detail. Summarizing our findings, we describe the current status of research on most RNA modifications and identify further developments in this field.
The final goal of the Russian part of the Chromosome-centric Human Proteome Project (C-HPP) was established as the analysis of the chromosome 18 (Chr 18) protein complement in plasma, liver tissue and HepG2 cells with the sensitivity of 10(-18) M. Using SRM, we have recently targeted 277 Chr 18 proteins in plasma, liver, and HepG2 cells. On the basis of the results of the survey, the SRM assays were drafted for 250 proteins: 41 proteins were found only in the liver tissue, 82 proteins were specifically detected in depleted plasma, and 127 proteins were mapped in both samples. The targeted analysis of HepG2 cells was carried out for 49 proteins; 41 of them were successfully registered using ordinary SRM and 5 additional proteins were registered using a combination of irreversible binding of proteins on CN-Br Sepharose 4B with SRM. Transcriptome profiling of HepG2 cells performed by RNAseq and RT-PCR has shown a significant correlation (r = 0.78) for 42 gene transcripts. A pilot affinity-based interactome analysis was performed for cytochrome b5 using analytical and preparative optical biosensor fishing followed by MS analysis of the fished proteins. All of the data on the proteome complement of the Chr 18 have been integrated into our gene-centric knowledgebase ( www.kb18.ru ).
Liver cancer is the third leading cause of cancer death worldwide. Representing such a dramatic impact on our lives, liver cancer is a significant public health concern. Sustainable and reliable methods for preventing and treating liver cancer require fundamental research on its molecular mechanisms. Cell lines are treated as in vitro equivalents of tumor tissues, making them a must-have for basic research on the nature of cancer. According to recent discoveries, certified cell lines retain most genetic properties of the original tumor and mimic its microenvironment. On the other hand, modern technologies allowing the deepest level of detail in omics landscapes have shown significant differences even between samples of the same cell line due to cross- and mycoplasma infection. This and other observations suggest that, in some cases, cell cultures are not suitable as cancer models, with limited predictive value for the effectiveness of new treatments. HepG2 is a popular hepatic cell line. It is used in a wide range of studies, from the oncogenesis to the cytotoxicity of substances on the liver. In this regard, we set out to collect up-to-date information on the HepG2 cell line to assess whether the level of heterogeneity of the cell line allows in vitro biomedical studies as a model with guaranteed production and quality.
We report the results obtained in 2012-2013 by the Russian Consortium for the Chromosome-centric Human Proteome Project (C-HPP). The main scope of this work was the transcriptome profiling of genes on human chromosome 18 (Chr 18), as well as their encoded proteome, from three types of biomaterials: liver tissue, the hepatocellular carcinoma-derived cell line HepG2, and blood plasma. The transcriptome profiling for liver tissue was independently performed using two RNaseq platforms (SOLiD and Illumina) and also by droplet digital PCR (ddPCR) and quantitative RT-PCR. The proteome profiling of Chr 18 was accomplished by quantitatively measuring protein copy numbers in the three types of biomaterial (the lowest protein concentration measured was 10(-13) M) using selected reaction monitoring (SRM). In total, protein copy numbers were estimated for 228 master proteins, including quantitative data on 164 proteins in plasma, 171 in the HepG2 cell line, and 186 in liver tissue. Most proteins were present in plasma at 10(8) copies/μL, while the median abundance was 10(4) and 10(5) protein copies per cell in HepG2 cells and liver tissue, respectively. In summary, for liver tissue and HepG2 cells a "transcriptoproteome" was produced that reflects the relationship between transcript and protein copy numbers of the genes on Chr 18. The quantitative data acquired by RNaseq, PCR, and SRM were uploaded into the "Update_2013" data set of our knowledgebase (www.kb18.ru) and investigated for linear correlations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.