2019
DOI: 10.1101/612176
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

deSALT: fast and accurate long transcriptomic read alignment with de Bruijn graph-based index

Abstract: Long-read RNA sequencing (RNA-seq) is promising to transcriptomics studies, however, the alignment of long RNA-seq reads is still non-trivial due to high sequencing errors and complicated gene structures. Herein, we propose deSALT, a tailored two-pass alignment approach, which constructs graph-based alignment skeletons to infer exons and uses them to generate spliced reference sequences to produce refined alignments. deSALT addresses several difficult technical issues, such as small exons and sequencing errors… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 38 publications
0
13
0
Order By: Relevance
“…289 ), NanoBLASTer 290 , mapAlign 291 , GraphAligner 292 , smsmap 293 , lordFAST 294 , S-conLSH 295 , QAlign 296 Splice-aware minimap2, Graphmap2 (ref. 102 ), GmAP 100 , STAR 101 , deSALT 103 , magic-BLAST 297 , Deep-Long 298 , uLTRA 299 Genome assembly Canu, miniasm 107 , Flye 110 , Redbean/wtdbg2 (ref. 111 ), Falcon-Unzip 300 , Shasta 164 , Raven 301 , NextDenovo (https://github.com/Nextomics/NextDenovo), Peregrine 302 , HINGe 303 , TULIP 304 , NeCAT 305 metagenome tailored metaFlye 306 , OPeRA-mS (hybrid) 106 Haplotype-aware Hifiasm 307 Genome polishing Nanopolish, Racon 308 , medaka (https://github.com/nanoporetech/medaka), NeuralPolish 309 , PePPeR-margin-DeepVariant 310 , NextPolish (https://github.com/Nextomics/NextPolish), POLCA 311 , HomoPolish 38 SV detection Sniffles 98 , SVIm 312 , NanoSV 112 , Picky 33 , NanoVar 113 , Dysgu 313 , SeNSV 314 , cuteSV 315 SNV detection LongShot 116 , DeepVariant 310 , iGDA 316 , Nanopanel2 (ref.…”
Section: Box 1 | Ont Devicesmentioning
confidence: 99%
See 1 more Smart Citation
“…289 ), NanoBLASTer 290 , mapAlign 291 , GraphAligner 292 , smsmap 293 , lordFAST 294 , S-conLSH 295 , QAlign 296 Splice-aware minimap2, Graphmap2 (ref. 102 ), GmAP 100 , STAR 101 , deSALT 103 , magic-BLAST 297 , Deep-Long 298 , uLTRA 299 Genome assembly Canu, miniasm 107 , Flye 110 , Redbean/wtdbg2 (ref. 111 ), Falcon-Unzip 300 , Shasta 164 , Raven 301 , NextDenovo (https://github.com/Nextomics/NextDenovo), Peregrine 302 , HINGe 303 , TULIP 304 , NeCAT 305 metagenome tailored metaFlye 306 , OPeRA-mS (hybrid) 106 Haplotype-aware Hifiasm 307 Genome polishing Nanopolish, Racon 308 , medaka (https://github.com/nanoporetech/medaka), NeuralPolish 309 , PePPeR-margin-DeepVariant 310 , NextPolish (https://github.com/Nextomics/NextPolish), POLCA 311 , HomoPolish 38 SV detection Sniffles 98 , SVIm 312 , NanoSV 112 , Picky 33 , NanoVar 113 , Dysgu 313 , SeNSV 314 , cuteSV 315 SNV detection LongShot 116 , DeepVariant 310 , iGDA 316 , Nanopanel2 (ref.…”
Section: Box 1 | Ont Devicesmentioning
confidence: 99%
“…Other aligners have also been developed, such as Graphmap2 (ref. 102 ) and deSALT 103 , for ONT transcriptome data. Especially for ONT direct RNA-sequencing reads with dense base modifications, Graphmap2 has a higher alignment rate than minimap2 (ref.…”
Section: Continuedmentioning
confidence: 99%
“…Sequence comparison using reference-based and reference-free alignments. Our further analysis mostly relies on read mappings, so at first we aligned reads using different tools: widely popular minimap2 (Li 2018) and specialized transcriptome aligners deSALT (Liu et al 2019) and GraphMap2 (Marić et al 2019) to exclude a possible bias. Although all three aligners yield largely similar results, GraphMap2 produced slightly shorter alignments than minimap2 and deSALT with more prominent differences between aligned lengths of PacBio and ONT reads.…”
Section: Resultsmentioning
confidence: 99%
“…Alignment to the genome using deSALT. PacBio reads were mapped to GRCm38 mouse reference genome with deSALT v1.5.6 (Liu et al 2019) using -t 16 -x ccs options. ONT reads were mapped with -t 16 -x ont1d options.…”
Section: Methodsmentioning
confidence: 99%
“…In many cases these higher error rates can prevent the correct identification of isoforms (11)(12)(13). Although several alignment software (14)(15)(16)(17)(18) are optimized to handle these errors, their shortcomings confound transcript identification and annotation. Many reads cannot be aligned and regions where the sequencing error rates are higher such as UTRs frequently produce ambiguous alignments.…”
Section: Introductionmentioning
confidence: 99%