2019
DOI: 10.1038/s41598-019-39108-2
|View full text |Cite
|
Sign up to set email alerts
|

Comparative analysis of whole-genome sequencing pipelines to minimize false negative findings

Abstract: Comprehensive and accurate detection of variants from whole-genome sequencing (WGS) is a strong prerequisite for translational genomic medicine; however, low concordance between analytic pipelines is an outstanding challenge. We processed a European and an African WGS samples with 70 analytic pipelines comprising the combination of 7 short-read aligners and 10 variant calling algorithms (VCAs), and observed remarkable differences in the number of variants called by different pipelines (max/min ratio: 1.3~3.4).… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
65
2
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 74 publications
(69 citation statements)
references
References 46 publications
1
65
2
1
Order By: Relevance
“…Many underlying algorithms of variant calling pipelines were developed for the analysis of variants in the human genome, e.g., to investigate genetic disorders or to study tumor samples [20,[41][42][43][44]. Although the applications in biomedical research and plant sciences differ substantially, plant scientists have largely followed benchmarking studies derived from research on human samples assuming similar performances.…”
Section: Introductionmentioning
confidence: 99%
“…Many underlying algorithms of variant calling pipelines were developed for the analysis of variants in the human genome, e.g., to investigate genetic disorders or to study tumor samples [20,[41][42][43][44]. Although the applications in biomedical research and plant sciences differ substantially, plant scientists have largely followed benchmarking studies derived from research on human samples assuming similar performances.…”
Section: Introductionmentioning
confidence: 99%
“…Overall, despite the enormous potential of WES and, mostly, WGS, data management, data analysis and biological interpretation are critical to achieve optimal results [37,38]. NGS sequencing produces millions/billions of massive sequencing reads; thus, a huge amount of raw data is generated, especially when high sequencing depth is required.…”
Section: Ngs Technologiesmentioning
confidence: 99%
“…Hwang et al, compared both the European NA12878 and the African NA19240 samples from the 1000 Genomes Project. They found a pipeline consisting of BWA-MEM and subsequently the Genome Analysis ToolKit with the Haplotype Caller to be su cient to reliably detect variants in most regions, except for rare variants and di cult regions, as for example simple repeats [9].…”
Section: Introductionmentioning
confidence: 99%