BACKGROUND
Untargeted multiomics datasets are obtained for samples in systems, synthetic, and chemical biology by integrating chromatographic separations with ion mobility-mass spectrometry (IM-MS) analysis. The datasets are interrogated using bioinformatics strategies to organize the data for identification prioritization.
CONTENT
The use of big data approaches for data mining of massive datasets in systems-wide analyses is presented. Untargeted biological data across multiomics dimensions are obtained using a variety of chromatography strategies with structural mass spectrometry. Separation timescales for different techniques and the resulting data deluge when combined with IM-MS is presented. Data mining self-organizing map (SOM) approaches are used to rapidly filter the data highlighting those features describing uniqueness to the query. Examples are provided in longitudinal analyses in synthetic biology, human liver exposure to acetaminophen, and in chemical biology, natural product discovery from bacterial biomes.
CONCLUSIONS
Matching separation timescales of different forms of chromatography with IM-MS provides sufficient multiomics selectivity to perform untargeted systems-wide analyses. New data mining strategies provide a means for rapidly interrogating these data sets for feature prioritization and discovery in a range of applications in systems, synthetic, and chemical biology.