2017
DOI: 10.1073/pnas.1618065114
|View full text |Cite
|
Sign up to set email alerts
|

PEMapper and PECaller provide a simplified approach to whole-genome sequencing

Abstract: The analysis of human whole-genome sequencing data presents significant computational challenges. The sheer size of datasets places an enormous burden on computational, disk array, and network resources. Here, we present an integrated computational package, PEMapper/PECaller, that was designed specifically to minimize the burden on networks and disk arrays, create output files that are minimal in size, and run in a highly computationally efficient way, with the single goal of enabling whole-genome sequencing a… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
1
1

Relationship

6
2

Authors

Journals

citations
Cited by 30 publications
(31 citation statements)
references
References 43 publications
(42 reference statements)
0
31
0
Order By: Relevance
“…69 Cleaned and annotated (reference genome assembly GRCh38) sequence data are placed into variant call format and annotated using ANNOVAR, 70 enabling standard analytic approaches to be applied, including variant classes (that is, exonic, intronic), rarity and functional effects. 7173 Annotation of SNVs from coding regions uses standard pipelines and bioinformatic filters for functional impact.…”
Section: Genomic Approachmentioning
confidence: 99%
“…69 Cleaned and annotated (reference genome assembly GRCh38) sequence data are placed into variant call format and annotated using ANNOVAR, 70 enabling standard analytic approaches to be applied, including variant classes (that is, exonic, intronic), rarity and functional effects. 7173 Annotation of SNVs from coding regions uses standard pipelines and bioinformatic filters for functional impact.…”
Section: Genomic Approachmentioning
confidence: 99%
“…, sites that change the primary coding sequence), and 10,409 silent SNVs. Both cases had ~1:1 silent-to-replacement sites and a transition-to-transversion ratio of ~2.3 for the human whole exome, indicating good quality sequencing 33 . A PPD was constructed using GenPro for each sample (Figure 1) using the variant base calls.…”
Section: Resultsmentioning
confidence: 98%
“…Whole-exome sequencing was performed using the SeqCap EZ Exome v2 Human Exome Capture Kit (Roche, Switzerland) according to the manufacturer’s protocol, with 100-bp paired-end sequencing on an Illumina HiSeq 2000 instrument to an average read depth of 100X. Raw sequencing reads were mapped to the whole human genome (GRCh38) using the PEMapper and variant sites were identified using PECaller with a theta of 0.001 and probability to call a variant site of 95% within the approximately 32 million bases that constitute the consensus coding domain 33 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Creating an annotation online is as simple as selecting the genome and assembly used to make the variant call format (VCF) [9] or SNP [10] format files, and uploading these files from a computer or Amazon S3 bucket, which can be easily linked to the web application. Annotation occurs in the cloud, where distributed instances of the Bystro annotation engine process the data and send the results back to the web application for storage and display (Figure 1).…”
Section: Discussionmentioning
confidence: 99%