2019
DOI: 10.1101/599704
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine learning to predict microbial community functions: An analysis of dissolved organic carbon from litter decomposition

Abstract: Microbial communities are ubiquitous and often influence macroscopic properties of the ecosystems they inhabit. However, deciphering the functional relationship between specific microbes and ecosystem properties is an ongoing challenge owing to the complexity of the communities. This challenge can be addressed, in part, by integrating the advances in DNA sequencing technology with computational approaches like machine learning. Although machine learning techniques have been applied to microbiome data, use of t… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 23 publications
(15 citation statements)
references
References 21 publications
0
15
0
Order By: Relevance
“…Machine learning algorithms, such as random forests (RF), are excellent tools for classifying microbiome features into various classes or categories, and which also allow us to dissect the relationships between microbial features and environmental attributes. RF is less sensitive to the sample size of the training data set and more accurate for prediction performance [27]. The 20 most important genera selected by RF provided an accuracy classification between the KO and SO groups (Fig.…”
Section: Krill Oil Partially Restored Trichuris Suis-induced Gut Micrmentioning
confidence: 99%
“…Machine learning algorithms, such as random forests (RF), are excellent tools for classifying microbiome features into various classes or categories, and which also allow us to dissect the relationships between microbial features and environmental attributes. RF is less sensitive to the sample size of the training data set and more accurate for prediction performance [27]. The 20 most important genera selected by RF provided an accuracy classification between the KO and SO groups (Fig.…”
Section: Krill Oil Partially Restored Trichuris Suis-induced Gut Micrmentioning
confidence: 99%
“…The Random Forest Indicator species analysis Neural Network (RFINN) platform (Thompson et al ., 2019) was used to identify subsets of genera that were found to be important features for prediction of either CO 2 or DOC. The data were pre‐processed with genera abundances computed as the average abundance of all OTUs within each genus.…”
Section: Methodsmentioning
confidence: 99%
“…Samples were clustered using the proximity matrix from an unsupervised random forest in the R package randomForest using 5000 trees and Ward method for hierarchical clustering. Random forest is a robust non-parametric, statistical learning method for the identification of clusters and variables of importance in complex multi-omics data [54,55]. The effect of different antibiotic regimes (clindamycin, no antibiotics and the combination of colistin, piperacillin-tazobactam, ceftazidime-avibactam, and vancomycin) on was tested using a supervised permutational random Forest (5000 trees 1000 permutations) where the three antibiotic regimes were used as supervising variable in the package rfPermute.…”
Section: Methodsmentioning
confidence: 99%