2009
DOI: 10.1038/nmeth.1361
|View full text |Cite
|
Sign up to set email alerts
|

Accurate determination of microbial diversity from 454 pyrosequencing data

Abstract: We present an algorithm, PyroNoise, that clusters the flowgrams of 454 pyrosequencing reads using a distance measure that models sequencing noise. This infers the true sequences in a collection of amplicons. We pyrosequenced a known mixture of microbial 16S rDNA sequences extracted from a lake and found that without noise reduction the number of operational taxonomic units is overestimated but using PyroNoise it can be accurately calculated.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

15
807
1
2

Year Published

2011
2011
2017
2017

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 896 publications
(825 citation statements)
references
References 152 publications
(192 reference statements)
15
807
1
2
Order By: Relevance
“…We did not discard entire sequence reads containing low‐quality score sections, which would have resulted in too few remaining sequences; instead, those sequences were truncated to exclude the poor‐quality section. After initial quality check, we then employed QIIME's algorithms that approximate PyroNoise (Quince et al., 2009; Reeder & Knight, 2010) to cluster the flowgram (analogous to the trace data in Sanger sequencing) and denoise the data. Next, we removed forward primer sequences from the end of reads, allowing at most one mismatch to the primer sequence, again to minimize the number of discards, where reads with more than one mismatch to the forward primer sequences were removed.…”
Section: Methodsmentioning
confidence: 99%
“…We did not discard entire sequence reads containing low‐quality score sections, which would have resulted in too few remaining sequences; instead, those sequences were truncated to exclude the poor‐quality section. After initial quality check, we then employed QIIME's algorithms that approximate PyroNoise (Quince et al., 2009; Reeder & Knight, 2010) to cluster the flowgram (analogous to the trace data in Sanger sequencing) and denoise the data. Next, we removed forward primer sequences from the end of reads, allowing at most one mismatch to the primer sequence, again to minimize the number of discards, where reads with more than one mismatch to the forward primer sequences were removed.…”
Section: Methodsmentioning
confidence: 99%
“…Processing of the resulting sequences, i.e. trimming and quality control, was performed with the MOTHUR software (v 1.30) (Schloss et al, 2009) including denoising of the flowgrams using PyroNoise (Quince et al 2009) and data normalization to the smallest number of sequences in the resulting libraries. Sequences ≥250 bp with no ambiguous base assignments and no homopolymers ≥8 bp were included in downstream analyses.…”
Section: Methodsmentioning
confidence: 99%
“…Other aspects, more on the technical side, are that primer selection may affect amplification of the community differently and the error attributed to the pyrosequencing technique itself that can lead to an overestimation of taxon abundance. If not corrected, this can be as much as 35% of the sequences (Gomez-Alvarez et al, 2009;Quince et al, 2009). Recently, the overestimation of taxa was illustrated when a single genome generated hundreds of different sequence types, leading to recommendations of strict quality filtering and careful application of sequence difference cut-offs for grouping sequences into operational taxa (Pukall et al, 2009;Purushe et al, 2010).…”
Section: Rumen Microbial Diversitymentioning
confidence: 99%