Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis

Wieder, Cecilia; Frainay, Clément; Poupin, Nathalie; Rodríguez-Mier, Pablo; Vinson, Florence; Cooke, Juliette; Lai, Rachel; Bundy, Jacob G.; Jourdan, Fabien; Ebbels, Timothy M. D.

doi:10.1371/journal.pcbi.1009105

Cited by 92 publications

(75 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A powerful method to describe peculiar features of the cell metabolism is pathway analysis (PA), which provides a graphical representation of the relationships among the actors (mainly enzymes and metabolites) of precise catalyzed reactions. Therefore, PA is highly employed for the interpretation of high-dimensional molecular data [74]. In fact, taking advantage of the already acquired knowledge of biological pathways, proteins, metabolites and also genes can be mapped onto newly developed pathways with the objective to draw their collective functions and interactions in that specific biological environment [75].…”

Section: Pathway Analysis Methodsmentioning

confidence: 99%

“…Three are the necessary inputs in ORA analysis: (i) a set of pathways (or metabolite collections); (ii) a catalog of investigating metabolites and, (iii) a background collection of compounds. The list of investigating metabolites usually comes from experimental data after applying a statistical test to determine those metabolites whose signals can be associated with a precise result by choosing a threshold value usually associated to the p-values [74]. The background collection includes all metabolites that can be revealed in the considered measurement.…”

Section: Over-representation Analysis (Ora)mentioning

confidence: 99%

See 1 more Smart Citation

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches

et al. 2022

View full text Add to dashboard Cite

NMR measurements combined with chemometrics allow achieving a great amount of information for the identification of potential biomarkers responsible for a precise metabolic pathway. These kinds of data are useful in different fields, ranging from food to biomedical fields, including health science. The investigation of the whole set of metabolites in a sample, representing its fingerprint in the considered condition, is known as metabolomics and may take advantage of different statistical tools. The new frontier is to adopt self-learning techniques to enhance clustering or classification actions that can improve the predictive power over large amounts of data. Although machine learning is already employed in metabolomics, deep learning and artificial neural networks approaches were only recently successfully applied. In this work, we give an overview of the statistical approaches underlying the wide range of opportunities that machine learning and neural networks allow to perform with accurate metabolites assignment and quantification.Various actual challenges are discussed, such as proper metabolomics, deep learning architectures and model accuracy.

show abstract

Section: Pathway Analysis Methodsmentioning

confidence: 99%

Section: Over-representation Analysis (Ora)mentioning

confidence: 99%

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches

et al. 2022

View full text Add to dashboard Cite

show abstract

“…Furthermore, the authors argued that the choice of gene set collections should not be made arbitrarily as certain gene sets may be more or less suitable for a particular dataset than others. In a recent study on best practices for the popular ORA method on metabolomics data [ 65 ], the authors also found that the results of pathway analysis substantially differed based on the choice of pathway database (i.e. KEGG, Reactome and BioCyc [ 66 ]).…”

Section: Impact Of Pathway Database and Gene Set Sizementioning

confidence: 99%

On the influence of several factors on pathway enrichment analysis

Mubeen

Kodamullil

Hofmann-Apitius

et al. 2022

Briefings in Bioinformatics

View full text Add to dashboard Cite

Pathway enrichment analysis has become a widely used knowledge-based approach for the interpretation of biomedical data. Its popularity has led to an explosion of both enrichment methods and pathway databases. While the elegance of pathway enrichment lies in its simplicity, multiple factors can impact the results of such an analysis, which may not be accounted for. Researchers may fail to give influential aspects their due, resorting instead to popular methods and gene set collections, or default settings. Despite ongoing efforts to establish set guidelines, meaningful results are still hampered by a lack of consensus or gold standards around how enrichment analysis should be conducted. Nonetheless, such concerns have prompted a series of benchmark studies specifically focused on evaluating the influence of various factors on pathway enrichment results. In this review, we organize and summarize the findings of these benchmarks to provide a comprehensive overview on the influence of these factors. Our work covers a broad spectrum of factors, spanning from methodological assumptions to those related to prior biological knowledge, such as pathway definitions and database choice. In doing so, we aim to shed light on how these aspects can lead to insignificant, uninteresting or even contradictory results. Finally, we conclude the review by proposing future benchmarks as well as solutions to overcome some of the challenges, which originate from the outlined factors.

show abstract

“…Details on various possibilities of handling NMR-based metabolomics data can be consulted elsewhere ( Blaise et al, 2021 ; Debik et al, 2022 ). Beyond statistical treatment, web-based tools like MetaboAnalyst ( Chong et al, 2018 ) allow to visualise metabolomics data in an user-friendly way, and are able to perform additional tasks, as for example pathway enrichment analysis ( Wieder et al, 2021 ).…”

Section: Computational Tools and Resourcesmentioning

confidence: 99%

Studying Metabolism by NMR-Based Metabolomics

Moco

2022

Front. Mol. Biosci.

View full text Add to dashboard Cite

During the past few decades, the direct analysis of metabolic intermediates in biological samples has greatly improved the understanding of metabolic processes. The most used technologies for these advances have been mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy. NMR is traditionally used to elucidate molecular structures and has now been extended to the analysis of complex mixtures, as biological samples: NMR-based metabolomics. There are however other areas of small molecule biochemistry for which NMR is equally powerful. These include the quantification of metabolites (qNMR); the use of stable isotope tracers to determine the metabolic fate of drugs or nutrients, unravelling of new metabolic pathways, and flux through pathways; and metabolite-protein interactions for understanding metabolic regulation and pharmacological effects. Computational tools and resources for automating analysis of spectra and extracting meaningful biochemical information has developed in tandem and contributes to a more detailed understanding of systems biochemistry. In this review, we highlight the contribution of NMR in small molecule biochemistry, specifically in metabolic studies by reviewing the state-of-the-art methodologies of NMR spectroscopy and future directions.

show abstract

Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis

Cited by 92 publications

References 57 publications

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches

On the influence of several factors on pathway enrichment analysis

Studying Metabolism by NMR-Based Metabolomics

Contact Info

Product

Resources

About