2017
DOI: 10.1186/s13100-017-0088-x
|View full text |Cite
|
Sign up to set email alerts
|

LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences

Abstract: BackgroundPopulation genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE pop… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 23 publications
0
20
0
Order By: Relevance
“…Their repetitive nature and large size (e.g., the Ty1 is approximately 6 kb) present major challenges to genome assembly, and traditional alignment pipelines will miss new insertions due to their absence in the reference genome. There have been many advances in the computational detection of TEs using short read sequencing data ( Ewing 2015 ; Rishishwar et al 2017 ), and long-read sequencing will likely represent the new gold standard for TE annotation ( Disdero and Filée 2017 ; Bergman 2018 ; Kutter et al 2018 ; Shahid and Slotkin 2020 ). However, there is still a wide range of false positives and false negatives associated with computational methods, and long-read sequencing is currently more expensive and less high-throughput than short read methods.…”
Section: Resultsmentioning
confidence: 99%
“…Their repetitive nature and large size (e.g., the Ty1 is approximately 6 kb) present major challenges to genome assembly, and traditional alignment pipelines will miss new insertions due to their absence in the reference genome. There have been many advances in the computational detection of TEs using short read sequencing data ( Ewing 2015 ; Rishishwar et al 2017 ), and long-read sequencing will likely represent the new gold standard for TE annotation ( Disdero and Filée 2017 ; Bergman 2018 ; Kutter et al 2018 ; Shahid and Slotkin 2020 ). However, there is still a wide range of false positives and false negatives associated with computational methods, and long-read sequencing is currently more expensive and less high-throughput than short read methods.…”
Section: Resultsmentioning
confidence: 99%
“…For example, the short length of the next-generation sequencing reads causes drastic ambiguity and imprecision in mapping the TE reads. Recent advance of the long read sequencing of PacBio ( Disdero and Filée, 2017 ) and Oxford Nanopore ( Debladis et al, 2017 ) is expected to overcome this shortcoming. In addition, due to the multiplicity of TEs in the genome and possible redundancy between them, the genetic analyses of TEs have been challenging.…”
Section: Discussionmentioning
confidence: 99%
“…Others on the contrary map first to the consensus sequence of previously identified TEs and map afterwards the paired read onto the genome, such as TRACKPOSON [108]. A more recent approach using long Pacbio reads has also been reported, but it still relies on the good annotation of reference TE sequences [109]. Another approach infers insertion events using the misalignment and split alignment of short reads without relying on a database of previously identified TEs.…”
Section: Bioinformatic Analysis Of Te Sequences In Genomesmentioning
confidence: 99%