2019
DOI: 10.7554/elife.46923
|View full text |Cite
|
Sign up to set email alerts
|

Consistent and correctable bias in metagenomic sequencing experiments

Abstract: Marker-gene and metagenomic sequencing have profoundly expanded our ability to measure biological communities. But the measurements they provide differ from the truth, often dramatically, because these experiments are biased toward detecting some taxa over others. This experimental bias makes the taxon or gene abundances measured by different protocols quantitatively incomparable and can lead to spurious biological conclusions. We propose a mathematical model for how bias distorts community measurements based … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

9
370
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 327 publications
(379 citation statements)
references
References 65 publications
(156 reference statements)
9
370
0
Order By: Relevance
“…Given that the collection of qPCR data involves calibration (via a "standard curve") and 16S relative abundance data does not usually involve any calibration, we modeled the efficiency of the 16S data compared to the qPCR data, rather than the efficiency of the qPCR data compared to the 16S data. This is consistent with recent literature highlighting the lack of calibration and resulting bias of 16S data for estimating relative abundance (McLaren et al 2019). Our method can also be used with other technologies for obtaining absolute and relative abundance data.…”
Section: Discussionsupporting
confidence: 90%
See 1 more Smart Citation
“…Given that the collection of qPCR data involves calibration (via a "standard curve") and 16S relative abundance data does not usually involve any calibration, we modeled the efficiency of the 16S data compared to the qPCR data, rather than the efficiency of the qPCR data compared to the 16S data. This is consistent with recent literature highlighting the lack of calibration and resulting bias of 16S data for estimating relative abundance (McLaren et al 2019). Our method can also be used with other technologies for obtaining absolute and relative abundance data.…”
Section: Discussionsupporting
confidence: 90%
“…, e q ), and e j is the efficiency of taxon j for being observed by the relative abundance technology compared to the absolute abundance technology. Our efficiency vector e plays the role of the "total protocol bias" parameter of McLaren et al (2019). We now discuss estimation of the parameters of this model, including the identifiability of the efficiencies e.…”
Section: A Model Linking Absolute and Relative Abundancesmentioning
confidence: 99%
“…Nucleic acids from a sample undergo virtually unbiased sequencing, i.e., with minimum prejudice towards specific organisms [47,48]; in theory, the method can be used to analyze a potentially unlimited range of targets [49,50]. Nevertheless, there is evidence that metagenomic sequencing is restricted by multiple pitfalls and biases, based on the pathogen's structure, extraction method, GC content, and other factors [51][52][53]. Therefore, there are five major limitations: (1) sequences of interest should share at least low identity with the analogous sequences in a reference genome, ensuring correct mapping; (2) the analytical complexity of obtained data often requires employment of a qualified bioinformatician and the use of a specialized computational infrastructure (including both hardware and software; (3) hidden experimental and methodological prejudices towards certain taxa; (4) defining the method's sensitivity and properly measuring it against a relevant reference; (5) dealing with the abundance of host-cell, bacterial and fungal nucleic acids.…”
Section: Metagenomic Approachmentioning
confidence: 99%
“…Each species was present at each concentration in 3 mock communities and all 9 mock communities had 3 species at each concentration. The resulting Sequencing read counts and the known proportional abundances of each species in each mock community were then used to estimate species-specific biases following the method of (McLaren et al, 2019). We found significant bias among the members of our mock communities (Figs.…”
Section: Estimating Sequencing Biasmentioning
confidence: 99%
“…A tractable approach to overcoming these limitations is to assemble synthetic microbial communities, where immigration history can be precisely controlled (Vorholt et al, 2017;Carlström et al, 2019). In addition, by using a simplified microbial community, with known membership, it is also possible to quantify and correct sequencing biases to more accurately assess community assembly outcomes (McLaren et al, 2019).…”
Section: Introductionmentioning
confidence: 99%