2018
DOI: 10.3390/metabo9010005
|View full text |Cite
|
Sign up to set email alerts
|

Random Forest Analysis of Untargeted Metabolomics Data Suggests Increased Use of Omega Fatty Acid Oxidation Pathway in Drosophila Melanogaster Larvae Fed a Medium Chain Fatty Acid Rich High-Fat Diet

Abstract: Obesity is a complex disease, shaped by both genetic and environmental factors such as diet. In this study, we use untargeted metabolomics and Drosophila melanogaster to model how diet and genotype shape the metabolome of obese phenotypes. We used 16 distinct outbred genotypes of Drosophila larvae raised on normal (ND) and high-fat (HFD) diets, to produce three distinct phenotypic classes; genotypes that stored more triglycerides on a ND relative to the HFD, genotypes that stored more triglycerides on a HFD re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 63 publications
(85 reference statements)
0
12
0
Order By: Relevance
“…Contrary to yield analysis which is univariate, the first hurdle encountered in metabolomic studies is the large number of variables, which requires sorting and prioritization to better study effects on involved compounds. Recognized thanks to its predictive accuracy as a classification method for complex traits genome-wide prediction [41], we used Random forest algorithm which is particularly adapted in metabolomic for biomarker identification [42][43][44]. The use of several contrasted growing locations allows to introduce environmental source of variation which is essential to assess variety stability for a given trait but may degrade algorithm predictive accuracy used for varietal marker characterization.…”
Section: Stability and Adaptability Concepts Applied To Metabolic Promentioning
confidence: 99%
“…Contrary to yield analysis which is univariate, the first hurdle encountered in metabolomic studies is the large number of variables, which requires sorting and prioritization to better study effects on involved compounds. Recognized thanks to its predictive accuracy as a classification method for complex traits genome-wide prediction [41], we used Random forest algorithm which is particularly adapted in metabolomic for biomarker identification [42][43][44]. The use of several contrasted growing locations allows to introduce environmental source of variation which is essential to assess variety stability for a given trait but may degrade algorithm predictive accuracy used for varietal marker characterization.…”
Section: Stability and Adaptability Concepts Applied To Metabolic Promentioning
confidence: 99%
“…In order to ensure robustness in our feature selection process, we further developed three different pairwise models to identify metabolite markers for each group: a significance analysis of microarray (SAM) [11], PLS-DA variable importance in projection (VIP) [12], and a random forest (RF) [13] classification model. SAM identified 73 out of 234 metabolites in the Tet-On Mfn2 induced fusion group, 71 out of 245 metabolites in the indirect fusion group, and 74 out of 233 metabolites in the Leflunomide treated group as significantly altered based on an FDR < 0.05 and a corresponding delta of 0.39, 0.38, and 0.32 for the Tet-On Mfn2, sgDrp1, and Leflunomide groups respectively (Figure 4B).…”
Section: Resultsmentioning
confidence: 99%
“…As a machine learning method, the random forest model is a regression tree technique that uses bootstrap aggregation and randomization of predictors to achieve a high degree of predictive accuracy [ 27 ]. The random forest model has been widely used to analyze complex metabolomics data [ 28 , 29 , 30 , 31 , 32 , 33 ] and has its unique advantages. First, it is relatively robust to outliers and noise.…”
Section: Resultsmentioning
confidence: 99%