2014
DOI: 10.1371/journal.pone.0109384
|View full text |Cite|
|
Sign up to set email alerts
|

Indexes of Large Genome Collections on a PC

Abstract: The availability of thousands of individual genomes of one species should boost rapid progress in personalized medicine or understanding of the interaction between genotype and phenotype, to name a few applications. A key operation useful in such analyses is aligning sequencing reads against a collection of genomes, which is costly with the use of existing algorithms due to their large memory requirements. We present MuGI, Multiple Genome Index, which reports all occurrences of a given pattern, in exact and ap… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 38 publications
(37 citation statements)
references
References 34 publications
0
37
0
Order By: Relevance
“…Past efforts that evaluated graph aligners have been selective about what variants to include in the graph, but without a clear rationale. Some included all variants from a defined subset of strains or haplotypes [6,23,27] or from a database such as the 1000 Genomes Project callset [2] or dbSNP [32]. In some cases, variants were filtered according to ethnicity, e.g.…”
Section: Variant Selection and Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…Past efforts that evaluated graph aligners have been selective about what variants to include in the graph, but without a clear rationale. Some included all variants from a defined subset of strains or haplotypes [6,23,27] or from a database such as the 1000 Genomes Project callset [2] or dbSNP [32]. In some cases, variants were filtered according to ethnicity, e.g.…”
Section: Variant Selection and Evaluationmentioning
confidence: 99%
“…GCSA2 [10] indexes paths in arbitrary graphs and is implemented in the VG software tool [11] which can align reads to such indexes. MuGI [27] and GraphTyper [21] use k-mer-based indexes.…”
Section: Introductionmentioning
confidence: 99%
“…Most existing indexing schemes for sequence graphs attempt to index k-mers in the graph, and they can broadly be categorized as being either hashing-based or BWTbased. The first hashing-based approach was introduced by Schneeberger et al (2009), and several related approaches based on hashing k-mers have been put forward since then (Danek et al, 2014;Limasset et al, 2016;Eggertsson et al, 2017;Petrov et al, 2018).…”
Section: Related Workmentioning
confidence: 99%
“…After finding seed occurrences for a read in this graph, the alignment was refined locally using dynamic programming. Similar k-mer indexing on sequence graphs has since been used and extended in several read mapping tools such as MuGI [128], BGREAT [129] and VG 8 .…”
Section: Read Mappingmentioning
confidence: 99%