2017
DOI: 10.1101/101493
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Superbubbles, Ultrabubbles and Cacti

Abstract: Abstract. A superbubble is a type of directed acyclic subgraph with single distinct source and sink vertices. In genome assembly and genetics, the possible paths through a superbubble can be considered to represent the set of possible sequences at a location in a genome. Bidirected and biedged graphs are a generalization of digraphs that are increasingly being used to more fully represent genome assembly and variation problems. Here we define snarls and ultrabubbles, generalizations of superbubbles for bidirec… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…Both superbubbles and ultrabubbles can also identify nesting and overlapping relationships involving structural variants, which makes them a more expressive definition of a site than the reference coordinate. Paten et al (2017) show how this site nesting is naturally described by a cactus graph, a structure that globally organizes the sites without the need for any existing reference genome. However, this approach is incomplete-not all variation is nicely partitioned into these bubbles.…”
Section: Allelism In Graphsmentioning
confidence: 99%
“…Both superbubbles and ultrabubbles can also identify nesting and overlapping relationships involving structural variants, which makes them a more expressive definition of a site than the reference coordinate. Paten et al (2017) show how this site nesting is naturally described by a cactus graph, a structure that globally organizes the sites without the need for any existing reference genome. However, this approach is incomplete-not all variation is nicely partitioned into these bubbles.…”
Section: Allelism In Graphsmentioning
confidence: 99%
“…To account for heterozygosity in a diploid genome, we perform bubble detection. The notion of bubble we use is closely based on the ultrabubble concept as defined by Paten et al (2017) . Briefly, bubbles have the following properties: Two-node-connectivity .…”
Section: Diploid Assembly Pipelinementioning
confidence: 99%
“…In the next stage, we use VG’s snarl decomposition algorithm ( Paten et al , 2017 ) to detect the regions of heterozygosity, or snarls , in the sequence graph. This results in 29 071 bubbles.…”
Section: Datasets and Experimental Setupmentioning
confidence: 99%
“…In order to represent these more challenging types of variation, we use a variation graph . This is a type of sequence graph —a mathematical graph in which nodes represent elements of sequence, augmented with 5′ and 3′sides, and edges are drawn between sides if the adjacency of sequence is observed in the population cohort (see Paten et al , 2017 ). Haplotypes are embedded as paths through oriented nodes in the graph.…”
Section: Methodsmentioning
confidence: 99%