An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data

Tan, Renjie; Wang, Yadong; Kleinstein, Sarah E.; Liu, Yongzhuang; Zhu, Xiaolin; Guo, Hongzhe; Jiang, Qinghua; Allen, Andrew S.; Zhu, Mingyuan

doi:10.1002/humu.22537

Cited by 201 publications

(231 citation statements)

References 46 publications

Supporting

Mentioning

210

Contrasting

Unclassified

Order By: Relevance

“…While XHMM utilizes the principal component analysis followed by the Hidden Markov model to identify CNVs, CoNIFER uses a singular value decomposition technique to correct systematic biases and identifies a CNV call if the corrected signal reaches a predefined threshold at no less than three subsequent exons (Fromer et al, 2012;Krumm et al, 2012). Breakpoint detection is an advantage of the XHMM (Fromer et al, 2012;Tan et al, 2014). According to our data, XHMM showed more reliable results for the size of the OTOA deletion.…”

Section: Discussionmentioning

confidence: 73%

“…We also noted that CoNIFER CNV analysis missed two heterozygous deletions used as positive controls. Recent CNV comparison studies in WES showed that by using the previously described heterozygosity check method (Zhu et al, 2012), CoNIFER detects less (40%) heterozygous false-positive deletions (for regions > 1 kb) compared to XHMM (64%) (Tan et al, 2014), suggesting that it might be missing some true positives to increase specificity. Conservative predefined thresholds in default settings of the CoNIFER might be the reason for missing heterozygous deletions in positive controls in our data set.…”

Section: Discussionmentioning

confidence: 99%

“…The main applications of WES in most laboratories are identifying single-nucleotide variants (SNVs) and small deletion/ insertions (INDELs). While various practical algorithms for the detection of copy number variants (CNVs) have recently been developed (Fromer et al, 2012;Krumm et al, 2012;Tan et al, 2014), their usage in general is not common and their application in deafness has not been reported.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Identification of Copy Number Variants Through Whole-Exome Sequencing in Autosomal Recessive Nonsyndromic Hearing Loss

Bademci

Diaz‐Horta

Guo

et al. 2014

Genetic Testing and Molecular Biomarkers

View full text Add to dashboard Cite

Genetic variants account for more than half of the cases with congenital or prelingual onset hearing loss. Autosomal recessive nonsyndromic hearing loss (ARNSHL) is the most common subgroup. Whole-exome sequencing (WES) has been shown to be effective detecting deafness-causing single-nucleotide variants (SNVs) and insertion/deletions (INDELs). After analyzing the WES data for causative SNVs or INDELs involving previously reported deafness genes in 78 families with ARNSHL, we searched for copy number variants (CNVs) through two different tools in 24 families that remained unresolved. We detected large homozygous deletions in STRC and OTOA in single families. Thus, causative CNVs in known deafness genes explain 2 out of 78 (2.6%) families in our sample set. We conclude that CNVs can be reliably detected through WES and should be the part of pipelines used to clarify genetic basis of hearing loss.

show abstract

Section: Discussionmentioning

confidence: 73%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Identification of Copy Number Variants Through Whole-Exome Sequencing in Autosomal Recessive Nonsyndromic Hearing Loss

Bademci

Diaz‐Horta

Guo

et al. 2014

Genetic Testing and Molecular Biomarkers

View full text Add to dashboard Cite

show abstract

“…Diagnostic and research laboratories, whether public or private, therefore tend to search for coding variants, most of which can be detected by WES, first. Such variants can also be detected by WGS, and several studies previously compared WES and WGS for different types of variations and/or in different contexts (9,(11)(12)(13)(14)(15)(16), but none of them in a really comprehensive manner. Here, we compared WES and WGS, in terms of detection rates and quality, for single-nucleotide variants (SNVs), small insertions/ deletions (indels), and copy-number variants (CNVs) within the regions of the human genome covered by WES, using the most recent next-generation sequencing (NGS) technologies.…”

mentioning

confidence: 99%

Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants

Belkadi

Bolze

Itan

et al. 2015

Proc. Natl. Acad. Sci. U.S.A.

520

318

View full text Add to dashboard Cite

We compared whole-exome sequencing (WES) and whole-genome sequencing (WGS) in six unrelated individuals. In the regions targeted by WES capture (81.5% of the consensus coding genome), the mean numbers of single-nucleotide variants (SNVs) and small insertions/deletions (indels) detected per sample were 84,192 and 13,325, respectively, for WES, and 84,968 and 12,702, respectively, for WGS. For both SNVs and indels, the distributions of coverage depth, genotype quality, and minor read ratio were more uniform for WGS than for WES. After filtering, a mean of 74,398 (95.3%) high-quality (HQ) SNVs and 9,033 (70.6%) HQ indels were called by both platforms. A mean of 105 coding HQ SNVs and 32 indels was identified exclusively by WES whereas 692 HQ SNVs and 105 indels were identified exclusively by WGS. We Sanger-sequenced a random selection of these exclusive variants. For SNVs, the proportion of false-positive variants was higher for WES (78%) than for WGS (17%). The estimated mean number of real coding SNVs (656 variants, ∼3% of all coding HQ SNVs) identified by WGS and missed by WES was greater than the number of SNVs identified by WES and missed by WGS (26 variants). For indels, the proportions of falsepositive variants were similar for WES (44%) and WGS (46%). Finally, WES was not reliable for the detection of copy-number variations, almost all of which extended beyond the targeted regions. Although currently more expensive, WGS is more powerful than WES for detecting potential disease-causing mutations within WES regions, particularly those due to SNVs.hole-exome sequencing (WES) is routinely used and is gradually being optimized for the detection of rare and common genetic variants in humans (1-8). However, wholegenome sequencing (WGS) is becoming increasingly attractive as an alternative, due to its broader coverage and decreasing cost (9-11). It remains difficult to interpret variants lying outside the protein-coding regions of the genome. Diagnostic and research laboratories, whether public or private, therefore tend to search for coding variants, most of which can be detected by WES, first. Such variants can also be detected by WGS, and several studies previously compared WES and WGS for different types of variations and/or in different contexts (9,(11)(12)(13)(14)(15)(16), but none of them in a really comprehensive manner. Here, we compared WES and WGS, in terms of detection rates and quality, for single-nucleotide variants (SNVs), small insertions/ deletions (indels), and copy-number variants (CNVs) within the regions of the human genome covered by WES, using the most recent next-generation sequencing (NGS) technologies. We aimed to identify the most efficient and reliable approach for identifying these variants in coding regions of the genome, to define the optimal analytical filters for decreasing the frequency of false-positive variants, and to characterize the genes that were either hard to sequence by either approach or were poorly covered by WES kits. ResultsWe compared the two NGS techniques, perform...

show abstract

“…Identification of somatic and germline mosaic events or copy number variants remain complicated despite the progress made in the field. 31,32 Copy number variants have an important contribution to many Mendelian disorders [33][34][35] and are often overlooked or disregarded in NGS studies. The pathogenic variant can also be unsequenced due to lack of targeting, capturing, or bad mapping quality.…”

Section: Gene Identification Studies In Mendelian Disordersmentioning

confidence: 99%

Lessons learned from gene identification studies in Mendelian epilepsy disorders

et al. 2015

View full text Add to dashboard Cite

Next-generation sequencing (NGS) technologies are now routinely used for gene identification in Mendelian disorders. Setting up cost-efficient NGS projects and managing the large amount of variants remains, however, a challenging job. Here we provide insights in the decision-making processes before and after the use of NGS in gene identification studies. Genetic factors are thought to have a role in~70% of all epilepsies, and a variety of inheritance patterns have been described for seizure-associated gene defects. We therefore chose epilepsy as disease model and selected 35 NGS studies that focused on patients with a Mendelian epilepsy disorder. The strategies used for gene identification and their respective outcomes were reviewed. High-throughput NGS strategies have led to the identification of several new epilepsy-causing genes, enlarging our knowledge on both known and novel pathomechanisms. NGS findings have furthermore extended the awareness of phenotypical and genetic heterogeneity. By discussing recent studies we illustrate: (I) the power of NGS for gene identification in Mendelian disorders, (II) the accelerating pace in which this field evolves, and (III) the considerations that have to be made when performing NGS studies. Nonetheless, the enormous rise in gene discovery over the last decade, many patients and families included in gene identification studies still remain without a molecular diagnosis; hence, further genetic research is warranted. On the basis of successful NGS studies in epilepsy, we discuss general approaches to guide human geneticists and clinicians in setting up cost-efficient gene identification NGS studies.

show abstract

An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data

Cited by 201 publications

References 46 publications

Identification of Copy Number Variants Through Whole-Exome Sequencing in Autosomal Recessive Nonsyndromic Hearing Loss

Identification of Copy Number Variants Through Whole-Exome Sequencing in Autosomal Recessive Nonsyndromic Hearing Loss

Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants

Lessons learned from gene identification studies in Mendelian epilepsy disorders

Contact Info

Product

Resources

About