2020
DOI: 10.1093/bioinformatics/btaa634
|View full text |Cite
|
Sign up to set email alerts
|

TALC: Transcript-level Aware Long-read Correction

Abstract: Motivation Long-read sequencing technologies are invaluable for determining complex RNA transcript architectures but are error-prone. Numerous “hybrid correction” algorithms have been developed for genomic data that correct long reads by exploiting the accuracy and depth of short reads sequenced from the same sample. These algorithms are not suited for correcting more complex transcriptome sequencing data. Results We have cre… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(15 citation statements)
references
References 61 publications
0
15
0
Order By: Relevance
“…Raw sequence data were then base-called using Guppy v3.6.1. Once the raw signal from the MinION fast5 files was converted into fastq files, the sequencing errors were corrected using TALC v1.01 (74)(https://gitlab.igh.cnrs.fr/lbroseus/TALC) by using the Illumina short reads sequence of kuruma shrimp hemocytes (DDBJ Sequence Read Archive (DRA) accession number DRA004781). The corrected long-read sequences from MinION and short-read sequences from Illumina Miseq were hybrid de novo assembled using rnaSPAdes v3.14.1 (75)(https://cab.spbu.ru/software/rnaspades/) and Trinity 2.10.0 (76)(https://github.com/trinityrnaseq/trinityrnaseq/wiki).…”
Section: Methodsmentioning
confidence: 99%
“…Raw sequence data were then base-called using Guppy v3.6.1. Once the raw signal from the MinION fast5 files was converted into fastq files, the sequencing errors were corrected using TALC v1.01 (74)(https://gitlab.igh.cnrs.fr/lbroseus/TALC) by using the Illumina short reads sequence of kuruma shrimp hemocytes (DDBJ Sequence Read Archive (DRA) accession number DRA004781). The corrected long-read sequences from MinION and short-read sequences from Illumina Miseq were hybrid de novo assembled using rnaSPAdes v3.14.1 (75)(https://cab.spbu.ru/software/rnaspades/) and Trinity 2.10.0 (76)(https://github.com/trinityrnaseq/trinityrnaseq/wiki).…”
Section: Methodsmentioning
confidence: 99%
“…Direct RNA reads were corrected using TALC (version 1.0) [ 36 ] with default parameters (except -k = 21) and Illumina reads from leaf and root samples of B. napus downloaded from the EBI database (ERX397788 and ERX397800). Before the correction step, raw reads were masked using DustMasker (version 1.0.0 from the BLAST 2.10.0 package) [ 37 ], and we kept reads with ≥150 unmasked positions and ≥75% of unmasked bases.…”
Section: Introductionmentioning
confidence: 99%
“…However, a correction step results in more mapped reads and a better definition of the intronic boundaries. The most efficient approach for correcting long reads in transcriptomic data is to use short (second generation) sequencing reads of the same samples (Amarasinghe et al, 2020; Hackl, Hedrich, Schultz, & Förster, 2014). For reasonable runtimes, ease of implementation and good correction results, we recommend using either TALC (Broseus et al, 2020), LoRDEC (Salmela & Rivals, 2014) or CoLoRMap (Haghshenas, Hach, Sahinalp, & Chauve, 2016). It is worth noting that hybrid correction gives excellent results even with data sourced from different laboratories (Broseus et al, 2020).…”
Section: Computational Detection Of Irmentioning
confidence: 99%