Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info. Both are offered free of charge to the research community.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-016-0953-9) contains supplementary material, which is available to authorized users.
13C NMR has many advantages for a metabolomics study, including a large spectral dispersion, narrow singlets at natural abundance, and a direct measure of the backbone structures of metabolites. However, it has not had widespread use because of its relatively low sensitivity compounded by low natural abundance. Here we demonstrate the utility of high-quality 13C NMR spectra obtained using a custom 13C-optimized probe on metabolomic mixtures. A workflow was developed to use statistical correlations between replicate 1D 13C and 1H spectra, leading to composite spin systems that can be used to search publicly available databases for compound identification. This was developed using synthetic mixtures and then applied to two biological samples, Drosophila melanogaster extracts and mouse serum. Using the synthetic mixtures we were able to obtain useful 13C–13C statistical correlations from metabolites with as little as 60 nmol of material. The lower limit of 13C NMR detection under our experimental conditions is approximately 40 nmol, slightly lower than the requirement for statistical analysis. The 13C and 1H data together led to 15 matches in the database compared to just 7 using 1H alone, and the 13C correlated peak lists had far fewer false positives than the 1H generated lists. In addition, the 13C 1D data provided improved metabolite identification and separation of biologically distinct groups using multivariate statistical analysis in the D. melanogaster extracts and mouse serum.
Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases. We also demonstrate several use cases for Wikidata, including the crowdsourced curation of biomedical ontologies, phenotype-based diagnosis of disease, and drug repurposing.
Tandem mass spectrometry based shotgun proteomics of distal gut microbiomes is exceedingly difficult due to the inherent complexity and taxonomic diversity of the samples. We introduce two new methodologies to improve metaproteomic studies of microbiome samples. These methods include the stable isotope labeling in mammals to permit protein quantitation across two mouse cohorts, as well as the application of activity-based probes to enrich and analyze both host and microbial proteins with specific functionalities. We used these technologies to study the microbiota from the adoptive T cell transfer mouse model of inflammatory bowel disease (IBD) and compare these samples to an isogenic control; thereby, limiting genetic and environmental variables that influence microbiome composition. The data and results generated highlight quantitative alterations in both host and microbial proteins due to intestinal inflammation and corroborates the observed phylogenetic changes in bacteria that accompany IBD in humans and mouse models. The combination of isotope labeling with shotgun proteomics resulted in the total identification of 4434 protein clusters expressed in the microbial proteomic environment, 276 of which demonstrated differential abundance between control and IBD mice. Notably, application of a novel cysteine-reactive probe uncovered several microbial proteases and hydrolases overrepresented in the IBD mice. Implementation of these methods demonstrates that substantial insights into the identity and dysregulation of host and microbial proteins altered in IBD can be accomplished and can be used in the interrogation of other microbiome-related diseases.
We demonstrate the global metabolic analysis of Caenorhabditis elegans stress responses using a mass spectrometry-based technique called Isotopic Ratio Outlier Analysis (IROA). In an IROA protocol, control and experimental samples are isotopically labeled with 95% and 5% 13C, and the two sample populations are mixed together for uniform extraction, sample preparation, and LC-MS analysis. This labeling strategy provides several advantages over conventional approaches: 1) compounds arising from biosynthesis are easily distinguished from artifacts, 2) errors from sample extraction and preparation are minimized because the control and experiment are combined into a single sample, 3) measurement of both the molecular weight and the exact number of carbon atoms in each molecule provides extremely accurate molecular formulae, and 4) relative concentrations of all metabolites are easily determined. A heat shock perturbation was conducted on C. elegans to demonstrate this approach. We identified many compounds that significantly changed upon heat shock, including several from the purine metabolism pathway, which we use to demonstrate the approach. The metabolomic response information by IROA may be interpreted in the context of a wealth of genetic and proteomic information available for C. elegans. Furthermore, the IROA protocol can be applied to any organism that can be isotopically labeled, making it a powerful new tool in a global metabolomics pipeline.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.