Ion mobility (IM) spectrometry provides semiorthogonal data to mass spectrometry (MS), showing promise for identifying unknown metabolites in complex non-targeted metabolomics data sets. While current literature has showcased IM−MS for identifying unknowns under near ideal circumstances, less work has been conducted to evaluate the performance of this approach in metabolomics studies involving highly complex samples with difficult matrices. Here, we present a workflow incorporating de novo molecular formula annotation and MS/MS structure elucidation using SIRIUS 4 with experimental IM collision cross-section (CCS) measurements and machine learning CCS predictions to identify differential unknown metabolites in mutant strains of Caenorhabditis elegans. For many of those ion features, this workflow enabled the successful filtering of candidate structures generated by in silico MS/MS predictions, though in some cases, annotations were challenged by significant hurdles in instrumentation performance and data analysis. While for 37% of differential features we were able to successfully collect both MS/MS and CCS data, fewer than half of these features benefited from a reduction in the number of possible candidate structures using CCS filtering due to poor matching of the machine learning training sets, limited accuracy of experimental and predicted CCS values, and lack of candidate structures resulting from the MS/MS data. When using a CCS error cutoff of ±3%, on average, 28% of candidate structures could be successfully filtered. Herein, we identify and describe the bottlenecks and limitations associated with the identification of unknowns in non-targeted metabolomics using IM−MS to focus and provide insights into areas requiring further improvement.
The use of quality control samples in metabolomics ensures data quality, reproducibility, and comparability between studies, analytical platforms, and laboratories. Long-term, stable, and sustainable reference materials (RMs) are a critical component of the quality assurance/quality control (QA/QC) system; however, the limited selection of currently available matrix-matched RMs reduces their applicability for widespread use. To produce an RM in any context, for any matrix that is robust to changes over the course of time, we developed iterative batch averaging method (IBAT). To illustrate this method, we generated 11 independently grown Escherichia coli batches and made an RM over the course of 10 IBAT iterations. We measured the variance of these materials by nuclear magnetic resonance (NMR) and showed that IBAT produces a stable and sustainable RM over time. This E. coli RM was then used as a food source to produce a Caenorhabditis elegans RM for a metabolomics experiment. The metabolite extraction of this material, alongside 41 independently grown individual C. elegans samples of the same genotype, allowed us to estimate the proportion of sample variation in preanalytical steps. From the NMR data, we found that 40% of the metabolite variance is due to the metabolite extraction process and analysis and 60% is due to sample-to-sample variance. The availability of RMs in untargeted metabolomics is one of the predominant needs of the metabolomics community that reach beyond quality control practices. IBAT addresses this need by facilitating the production of biologically relevant RMs and increasing their widespread use.
Large-scale untargeted metabolomics studies suffer from individual variation, batch effects and instrument variability, making comparisons of common spectral features across studies difficult. One solution is to compare studies after compound identification. However, compound identification is expensive and time consuming. We successfully identify common spectral features across multiple studies, with a generalizable experimental design approach. First, we included an anchor strain, PD1074, during sample and data collection. Second, we collected data in blocks with multiple controls. These anchors enabled us to successfully integrate three studies of Caenorhabditis elegans for nuclear magnetic resonance (NMR) spectroscopy and liquid chromatography-mass spectrometry (LC-MS) data from five different assays. We found 34% and 14% of features to be significant in LC-MS and NMR, respectively. Between 20-50% of spectral features differ in a mutant and among a set of genetically diverse natural strains, suggesting this reduced set of spectral features are excellent targets for compound identification.
Ion mobility (IM) spectrometry provides semi-orthogonal data to mass spectrometry (MS), showing promise for identifying unknown metabolites in complex non-targeted metabolomics datasets. While current literature has showcased IM-MS for identifying unknowns under near ideal circumstances, less work has been conducted to evaluate the performance of this approach in metabolomics studies involving highly complex samples with difficult matrices. Here, we present a workflow incorporating de novo molecular formula annotation and MS/MS structure elucidation using SIRIUS 4 with experimental IM collision cross-section (CCS) measurements and machine learning CCS predictions to identify differential unknown metabolites in mutant strains of Caenorhabditis elegans. For many of those ion features this workflow enabled the successful filtering of candidate structures generated by in silico MS/MS predictions, though in some cases annotations were challenged by significant hurdles in instrumentation performance and data analysis. While for 37% of differential features we were able to successfully collect both MS/MS and CCS data, fewer than half of these features benefited from a reduction in the number of possible candidate structures using CCS filtering due to poor matching of the machine learning training sets, limited accuracy of experimental and predicted CCS values, and lack of candidate structures resulting from the MS/MS data. When using a CCS error cutoff of +/-3%, an average 28% of candidate structures could be successfully filtered. Herein, we identify and describe the bottlenecks and limitations associated with the identification of unknowns in nontargeted metabolomics using IM-MS to focus and provide insight on areas requiring further improvement.
The use of quality control samples in metabolomics ensures data quality, reproducibility and comparability between studies, analytical platforms and laboratories. Long-term, stable and sustainable reference materials (RMs) are a critical component of the QA/QC system, however, the limited selection of currently available matrix matched RMs reduce their applicability for widespread use. To produce a RM in any context, for any matrix that is robust to changes over the course of time we developed IBAT (Iterative Batch Averaging meThod). To illustrate this method, we generated 11 independently grown E. coli batches and made a RM over the course of 10 IBAT iterations. We measured the variance of these materials by NMR and showed that IBAT produces a stable and sustainable RM over time. This E. coli RM was then used as food source to produce a C. elegans RM for a metabolomics experiment. The metabolite extraction of this material alongside 41 independently grown individual C. elegans samples of the same genotype, allowed to estimate the proportion of sample variation in pre-analytical steps. From the NMR data, we found that 40% of the metabolite variance is due to the metabolite extraction process and analysis and 60% is due to sample-to-sample variance. The availability of RMs in untargeted metabolomics is one of the predominant needs of the metabolomics community that reach beyond quality control practices. IBAT addresses this need by facilitating the production of biologically relevant RMs and increasing their widespread use.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.