2019
DOI: 10.1101/654566
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Genotyping structural variants in pangenome graphs using the vg toolkit

Abstract: Structural variants (SVs) remain challenging to represent and study relative to point mutations despite their demonstrated importance. We show that variation graphs, as implemented in the vg toolkit, provide an e ective means for leveraging SV catalogs for short-read SV genotyping experiments. We benchmarked vg against state-of-the-art SV genotypers using three sequence-resolved SV catalogs generated by recent long-read sequencing studies. In addition, we use assemblies from 12 yeast strains to show that graph… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

5
114
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 70 publications
(119 citation statements)
references
References 39 publications
5
114
0
Order By: Relevance
“…We constructed bovine variation-aware reference graphs using a Hereford-based linear reference sequence as backbone and variants that were filtered for allele frequency in four cattle breeds. Using both simulated and real short read data, our findings corroborate that a variation-aware reference facilitates accurate read mapping and unbiased sequence variant genotyping [22,23,[32][33][34].…”
Section: Discussionsupporting
confidence: 65%
See 2 more Smart Citations
“…We constructed bovine variation-aware reference graphs using a Hereford-based linear reference sequence as backbone and variants that were filtered for allele frequency in four cattle breeds. Using both simulated and real short read data, our findings corroborate that a variation-aware reference facilitates accurate read mapping and unbiased sequence variant genotyping [22,23,[32][33][34].…”
Section: Discussionsupporting
confidence: 65%
“…Sequence variant genotyping from graph-based alignments using vg call is currently somewhat limited to structural variants [32]. In order to detect SNPs and Indels from the variation-aware reference graph using widely-used sequence variant genotyping methods, we had to make the graph-based alignments compatible with linear coordinates.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Paragraph works by aligning and genotyping reads on a local sequence graph constructed for each targeted SV. This approach is different from other proposed and most existing graph methods that create a single whole-genome graph and align all reads to this large graph 18,39 . A whole-genome graph may be able to rescue reads from novel insertions that are misaligned to other parts of the genome in the original linear reference, however, the computational cost of building such a graph and performing alignment against this graph is very high.…”
Section: Discussionmentioning
confidence: 91%
“…Since SV genotyping tools can be applied to TE genotyping with additional TE annotation, we introduce some of the tools in this review to facilitate the development of better tools using this approach. Paragraph (Chen et al., 2019) and GraphTyper2 (Eggertsson et al., 2019) genotype SVs; vg toolkit (Hickey et al., 2020) genotypes SVs, SNVs, and indels; and SVJedi (Lecompte, Peterlongo, Lavenier, & Lemaitre, 2019) genotypes SVs for long‐read sequencing data. These graph alignment tools construct a genome graph based on detected SVs and align reads to it (Fig.…”
Section: Te Insertion Genotypingmentioning
confidence: 99%