2019
DOI: 10.1101/572545
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ExpansionHunter: A sequence-graph based tool to analyze variation in short tandem repeat regions

Abstract: We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci. Availability and implementation:ExpansionHunter is implemented in C… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
65
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 46 publications
(68 citation statements)
references
References 21 publications
3
65
0
Order By: Relevance
“…Graphs provide a natural way of expressing variation or uncertainty in a genome [ 1 , 2 ]. They have been used for diverse applications such as genome assembly [ 3 5 ], error correction [ 6 8 ], short tandem repeat genotyping [ 9 ], structural variation genotyping [ 10 ], and reference-free haplotype reconstruction [ 11 ]. With the growing usage of graphs, methods for handling graphs efficiently are becoming a crucial requirement for many applications.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Graphs provide a natural way of expressing variation or uncertainty in a genome [ 1 , 2 ]. They have been used for diverse applications such as genome assembly [ 3 5 ], error correction [ 6 8 ], short tandem repeat genotyping [ 9 ], structural variation genotyping [ 10 ], and reference-free haplotype reconstruction [ 11 ]. With the growing usage of graphs, methods for handling graphs efficiently are becoming a crucial requirement for many applications.…”
Section: Introductionmentioning
confidence: 99%
“…In 2002, partial order alignment [ 19 ] (POA), a special case of Navarro’s algorithm for acyclic graphs, was published for multiple sequence alignment. Although POA is defined only for acyclic graphs, it can be extended to cyclic graphs by unfolding cyclic components, which is the approach taken by the VG toolkit [ 16 ] and ExpansionHunter [ 9 ]. The practical efficiency of this unfolding depends on the read length, and the graph topology and complex cyclic areas can lead to very large unfolded graphs [ 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…VNTRs are associated with a wide spectrum of complex traits and diseases including attention-deficit disorder, Type 1 Diabetes and schizophrenia (Hannan 2018) . While STR variation has been profiled in human populations (Mallick et al 2016) and to find expression quantitative trait loci (eQTL) (Fotsing et al 2019;Gymrek et al 2016) , and variation at VNTR sequences may be detected for targeted loci (Bakhtiari et al 2018;Dolzhenko et al 2019) , the landscape of VNTR variation in populations and effects on human phenotypes are not yet examined genome-wide. Large scale sequencing studies including the 1000 Genomes Project (1000Genomes Project Consortium et al 2015 , TOPMed (Taliun et al 2019) and DNA sequencing by the Genotype-Tissue Expression (GTEx) project (G. Consortium and GTEx Consortium 2017) rely on high-throughput sequencing (SRS) characterized by SRS reads up to 150 bases.…”
Section: Introductionmentioning
confidence: 99%
“…Another alternative, less described, could be the use of dedicated tools for each type of insertion, instead of using only general-purpose SV callers. Among them, Expansion Hunter has been designed to detect tandem repeats, Pamir and Popins for novel insertions and TARDIS for large duplications [ 28 31 ].…”
Section: Discussionmentioning
confidence: 99%