2022
DOI: 10.1038/s41587-021-01158-1
|View full text |Cite
|
Sign up to set email alerts
|

Curated variation benchmarks for challenging medically relevant autosomal genes

Abstract: The repetitive nature and complexity of some medically relevant genes poses a challenge for their accurate analysis in a clinical setting. The Genome in a Bottle Consortium has provided variant benchmark sets, but these exclude nearly four hundred medically relevant genes due to their repetitiveness or polymorphic complexity. Here we characterize 273 of these 395 challenging autosomal genes using a haplotype-resolved whole-genome assembly. This curated benchmark reports over 17,000 single nucleotide variations… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
202
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 143 publications
(206 citation statements)
references
References 72 publications
4
202
0
Order By: Relevance
“…We then manually investigated the SVs that were exclusively called by , discovering that all them exhibited two alleles, one per haplotype (i.e., heterozygous SVs with two non-reference alleles). This result confirms previous findings [52] that heterozygous insertions in tandem repeats are among the most challenging classes of SVs to discover with current methods.…”
Section: Resultssupporting
confidence: 92%
See 4 more Smart Citations
“…We then manually investigated the SVs that were exclusively called by , discovering that all them exhibited two alleles, one per haplotype (i.e., heterozygous SVs with two non-reference alleles). This result confirms previous findings [52] that heterozygous insertions in tandem repeats are among the most challenging classes of SVs to discover with current methods.…”
Section: Resultssupporting
confidence: 92%
“…To perform a more thorough analysis of the HG002 individual, we considered the CMRG (Challenging Medically Relevant Genes) callset provided in [52] and we evaluated callers’ accuracy against it. The CMRG callset consists of 250 SVs falling in 126 challenging and medically relevant genes that were excluded from the previously published GIAB benchmark [65] due to their complexity: compound heterozygous insertions, complex variants in segmental duplications, and long tandem repeats.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations