2018
DOI: 10.1101/439026
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accurate characterization of expanded tandem repeat length and sequence through whole genome long-read sequencing on PromethION

Abstract: De Roeck et al., PromethION sequencing and tandem repeat characterization 2 Abstract Tandem repeats (TRs) can cause disease through their length, sequence motif interruptions, and nucleotide modifications. For many TRs, however, these features are very difficult -if not impossible -to assess, requiring low-throughput and labor-intensive assays. One example is a VNTR in ABCA7 for which we recently discovered that expanded alleles strongly increase risk of Alzheimer's disease. Here, we investigated the potential… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
14
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
6
3

Relationship

5
4

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 39 publications
0
14
0
Order By: Relevance
“…Finally, we also intend to add an optional dynamic time warping (DTW) step to UNCALLED, making it a full-scale signal-to-basepair aligner. This could aid in raw signal applications outside of ReadUntil, such as assembly polishing, identifying nucleotide modifications 3 , and classifying variable number tandem repeats 37 . UNCALLED could improve the sensitivity of such analyses by eliminating the need for basecalling, which can be error prone around such features.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, we also intend to add an optional dynamic time warping (DTW) step to UNCALLED, making it a full-scale signal-to-basepair aligner. This could aid in raw signal applications outside of ReadUntil, such as assembly polishing, identifying nucleotide modifications 3 , and classifying variable number tandem repeats 37 . UNCALLED could improve the sensitivity of such analyses by eliminating the need for basecalling, which can be error prone around such features.…”
Section: Discussionmentioning
confidence: 99%
“…Deletions were characterized by first intersecting their reference coordinates with all RepeatMasker 28 entries downloaded from the UCSC Genome Browser 30 using bedtools 45 . If no overlap was found with RepeatMasker, we searched for overlaps with the "Simple Repeat" track from UCSC Genome Browser, which is based on Tandem Repeat Finder annotations 37 . Insertions were classified by extracting the insert sequence output by Sniffles for the PacBio HiFi SV calls, aligning this sequence to GRcH38 using BWA MEM 26 , and intersecting the alignments with RepeatMasker and Simple Repeats.…”
Section: Structural Variant Analysismentioning
confidence: 99%
“…Routine human genome sequencing has become possible on the recently commercially available PromethION sequencer. A PromethION flow cell has 3000 sensors and 12,000 pores, which generate on average 70 Gb of data in our hands, with a considerable variability (De Roeck et al 2018), allowing for the sequencing of a 20× covered human genome per flow cell. On PromethION devices, either 24 or 48 flow cells can be run simultaneously on the machine.…”
mentioning
confidence: 99%
“…There have been few reports on the use of long-read sequencing for the analysis of specific TRs implicated in diseases [2022]. Genotyping tools utilizing long-read sequencing data, such as Nanosatellite[21], RepeatHMM [23] and Tandem-genotypes [24] have been reported in the recent years with varying performance across different length of repeat units and repeat length. We reported VNTRTyper in Ganesamoorthy et.…”
Section: Introductionmentioning
confidence: 99%