The genome sequencing of H37Rv strain of Mycobacterium tuberculosis was completed in 1998 followed by the whole genome sequencing of a clinical isolate, CDC1551 in 2002. Since then, the genomic sequences of a number of other strains have become available making it one of the better studied pathogenic bacterial species at the genomic level. However, annotation of its genome remains challenging because of high GC content and dissimilarity to other model prokaryotes. To this end, we carried out an in-depth proteogenomic analysis of the M. tuberculosis H37Rv strain using Fourier transform mass spectrometry with high resolution at both MS and tandem MS levels. In all, we identified 3176 proteins from Mycobacterium tuberculosis representing ϳ80% of its total predicted gene count. In addition to protein database search, we carried out a genome database search, which led to identification of ϳ250 novel peptides. Based on these novel genome search-specific peptides, we discovered 41 novel protein coding genes in the H37Rv genome. Using peptide evidence and alternative gene prediction tools, we also corrected 79 gene models. Finally, mass spectrometric data from N terminus-derived peptides confirmed 727 existing annotations for translational start sites while correcting those for 33 proteins. We report creation of a high confidence set of protein coding regions in Mycobacterium tuberculosis genome obtained by high resolution tandem mass-spectrometry at both precursor and fragment detection steps for the first time. This proteogenomic approach should be generally applicable to other organisms whose genomes have already been sequenced for obtaining a more accurate catalogue of protein-coding genes.
Plasma is the most easily accessible source for biomarker discovery in clinical proteomics. However, identifying potential biomarkers from plasma is a challenge given the large dynamic range of proteins. The potential biomarkers in plasma are generally present at very low abundance levels and hence identification of these low abundance proteins necessitates the depletion of highly abundant proteins. Sample pre-fractionation using immuno-depletion of high abundance proteins using multi-affinity removal system (MARS) has been a popular method to deplete multiple high abundance proteins. However, depletion of these abundant proteins can result in concomitant removal of low abundant proteins. Although there are some reports suggesting the removal of non-targeted proteins, the predominant view is that number of such proteins is small. In this study, we identified proteins that are removed along with the targeted high abundant proteins. Three plasma samples were depleted using each of the three MARS (Hu-6, Hu-14 and Proteoprep 20) cartridges. The affinity bound fractions were subjected to gelC-MS using an LTQ-Orbitrap instrument. Using four database search algorithms including MassWiz (developed in house), we selected the peptides identified at <1% FDR. Peptides identified by at least two algorithms were selected for protein identification. After this rigorous bioinformatics analysis, we identified 101 proteins with high confidence. Thus, we believe that for biomarker discovery and proper quantitation of proteins, it might be better to study both bound and depleted fractions from any MARS depleted plasma sample.
Severe acute respiratory syndrome coronavirus (SARS-CoV)-2 infection in the Golden Syrian hamster causes lung pathology that resembles human coronavirus disease (COVID-19). However, extra-pulmonary pathologies associated with SARS-CoV-2 infection and post COVID sequelae remain to be understood. Here we show, using a hamster model, that the early phase of SARS-CoV-2 infection leads to an acute inflammatory response and lung pathologies, while the late phase of infection causes cardiovascular complications (CVC) characterized by ventricular wall thickening associated with increased ventricular mass/ body mass ratio and interstitial coronary fibrosis. Molecular profiling further substantiated our findings of CVC, as SARS-CoV-2-infected hamsters showed elevated levels of serum cardiac Troponin-I (cTnI), cholesterol, low-density lipoprotein and long-chain fatty acid triglycerides. Serum metabolomics profiling of SARS-CoV-2-infected hamsters identified N-acetylneuraminate, a functional metabolite found to be associated with CVC, as a metabolic marker was found to be common between SARS-CoV-2-infected hamsters and COVID-19 patients. Together, we propose hamsters as a suitable animal model to study post-COVID sequelae associated with CVC which could be extended to therapeutic interventions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.