2017
DOI: 10.12688/f1000research.11254.1
|View full text |Cite
|
Sign up to set email alerts
|

DangerTrack: A scoring system to detect difficult-to-assess regions

Abstract: Over recent years, multiple groups have shown that a large number of structural variants, repeats, or problems with the underlying genome assembly have dramatic effects on the mapping, calling, and overall reliability of single nucleotide polymorphism calls. This project endeavored to develop an easy-to-use track for looking at structural variant and repeat regions. This track, DangerTrack, can be displayed alongside the existing Genome Reference Consortium assembly tracks to warn clinicians and biologists whe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 8 publications
0
10
0
Order By: Relevance
“…S4). The short reads showed a few regions of extreme amplification (>100-fold) that were not detected by the long reads, although subsequent analysis showed these regions were highly enriched for low mappability regions (Dolgalev et al 2017) in the genome and therefore most likely to be mapping artifacts (Supplemental Fig. S5).…”
Section: Read Mapping and Copy Number Analysismentioning
confidence: 99%
“…S4). The short reads showed a few regions of extreme amplification (>100-fold) that were not detected by the long reads, although subsequent analysis showed these regions were highly enriched for low mappability regions (Dolgalev et al 2017) in the genome and therefore most likely to be mapping artifacts (Supplemental Fig. S5).…”
Section: Read Mapping and Copy Number Analysismentioning
confidence: 99%
“…Figures 3a-c show alignment rate and correctness stratified by the number of nonreference SNVs overlapped by a read. To obtain these subsets, we first removed reads originating from reference-genome regions deemed repetitive by DangerTrack [34] (score over 250). We did this after finding that these regions had a combination of low SNV density and repetitive content that caused the 0-SNV stratum to behave very differently from the others.…”
Section: Stratification By Variant Density Variant Rarity and Repetmentioning
confidence: 99%
“…In the "constrained" model, to mimic the observed "geographical clustering" of breakpoints, a random percentage between 50 and 100% of breakpoints had to occur in the same chromosome while the remaining ones were free. As we know that some genomic regions are poorly analyzed for structural variants, we used the "DangerTrack" bed file (Dolgalev et al 2017) to exclude all breakpoints simulated in these regions and be comparable with the observed dataset. A single breakpoint of our in-house dataset overlaps the DangerTrack.…”
Section: Breakpoint Simulationmentioning
confidence: 99%