2018
DOI: 10.1186/s40168-018-0568-3
|View full text |Cite
|
Sign up to set email alerts
|

Fecal source identification using random forest

Abstract: BackgroundClostridiales and Bacteroidales are uniquely adapted to the gut environment and have co-evolved with their hosts resulting in convergent microbiome patterns within mammalian species. As a result, members of Clostridiales and Bacteroidales are particularly suitable for identifying sources of fecal contamination in environmental samples. However, a comprehensive evaluation of their predictive power and development of computational approaches is lacking. Given the global public health concern for waterb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

2
72
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 105 publications
(74 citation statements)
references
References 63 publications
2
72
0
Order By: Relevance
“…We used spearman algorithm to analyze the relationship among microbiota, predicted pathways and SLE activity index. The Random Forest models were trained by "randomForest" package with default parameters in R, then the performance of the model was assessed with a ten-fold cross-validation approach and measured by area under the receiver-operating characteristic (ROC) (33). All tests were performed using GraphPad Prism (v6.0) (GraphPad Software, Inc., CA, USA), SPSS Statistics (V.24.0.0.0) (SPSS Inc., Chicago, USA) or R software (Version 3.4.4).…”
Section: ) Thementioning
confidence: 99%
“…We used spearman algorithm to analyze the relationship among microbiota, predicted pathways and SLE activity index. The Random Forest models were trained by "randomForest" package with default parameters in R, then the performance of the model was assessed with a ten-fold cross-validation approach and measured by area under the receiver-operating characteristic (ROC) (33). All tests were performed using GraphPad Prism (v6.0) (GraphPad Software, Inc., CA, USA), SPSS Statistics (V.24.0.0.0) (SPSS Inc., Chicago, USA) or R software (Version 3.4.4).…”
Section: ) Thementioning
confidence: 99%
“…Computational or machine learning approaches such as Source Tracker (53) or random forest (54) can use sequence abundance patterns of the whole community, or of taxonomic groups, to identify pollution signals within a water sample. These methods rely on signatures of sequences that include their relative abundance patterns within the community, and sequences shared between sources generally do not also share overall relative abundance patterns within the signature (54). Furthermore, fecal bacterium sequences within these data sets that do not match a characterized source could be used to indicate extraneous sources that may be contributing fecal indicator bacteria but are not considered a significant human health risk (i.e., bird or pet waste and urban wildlife).…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning is widely used in several completely different fields; detection of regions of interests, image editing, texture analysis, visual aesthetic and quality assessment (Datta et al, 2006;Marchesotti et al, 2011;Romero et al, 2012;Fernandez-Lozano et al, 2015;Mata et al, 2018) and more recently in Carballal et al (2019b) or for microbiome analysis (Liu et al, 2017;Roguet et al, 2018), authentication of tequilas (Pérez-Caballero et al, 2017;Andrade et al, 2017), pathogenic point mutations (Rogers et al, 2018) or forensic identification (Gómez et al, 2018). Finally, with regard to the extraction of characteristics from images, some recently published works have been revised (Xu et al, 2018;Ali et al, 2016b;Wang et al, 2018;Sun et al, 2018;Zafar et al, 2018b).…”
Section: /20mentioning
confidence: 99%