2010
DOI: 10.1016/j.watres.2010.05.019
|View full text |Cite
|
Sign up to set email alerts
|

Novel application of a statistical technique, Random Forests, in a bacterial source tracking study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
1
2

Year Published

2011
2011
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 68 publications
(40 citation statements)
references
References 26 publications
0
34
1
2
Order By: Relevance
“…Equivalent or even superior performances have been reported for Linear Discriminant Analysis and Random Forests when compared with Neural Networks, Classification Trees and Support Vector Machines [see e.g. [34,47,58,67,68]]. However, controversy still prevails regarding the effects on classifiers' performance of different combinations of predictors, data assumptions, sample sizes and parameters tuning [16,17,31,58,69,70].…”
Section: Discussionmentioning
confidence: 99%
“…Equivalent or even superior performances have been reported for Linear Discriminant Analysis and Random Forests when compared with Neural Networks, Classification Trees and Support Vector Machines [see e.g. [34,47,58,67,68]]. However, controversy still prevails regarding the effects on classifiers' performance of different combinations of predictors, data assumptions, sample sizes and parameters tuning [16,17,31,58,69,70].…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, there have been studies with fairly small samples in which new classification methods such as RF and NN have been applied with high accuracy [8,13,28]. Some studies have reported equivalent or even superior performance of LR and LDA in comparison with NN, SVM, RF, and FCM [9,13,20,29,30]. Since the performance of NN and SVM depends on tuning parameters, these parameters were optimally determined by grid search.…”
Section: Discussionmentioning
confidence: 99%
“…There are numerous machine learning algorithms, including support vector machines [71], artificial neural networks [72], and random forests [73] that can be trained with known microbial community structures to classify environmental samples of unknown origin. Random forest have successfully been used in the context of source tracking using phenotypic data of E. coli [74] and classifying hosts according to Blautia profiles [70], and its applicability to microbiome data is shown in a recent review by Statnikov et al [75]. The random forest algorithm can handle very large input datasets, tolerate outliers in the input data, and generate an unbiased estimate of the classification error.…”
Section: Host Microbiomes and Specificity Of Commensal Organismsmentioning
confidence: 99%