2023
DOI: 10.1038/s41467-023-38553-y
|View full text |Cite
|
Sign up to set email alerts
|

Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2

Abstract: Long-read sequencing technologies have improved significantly since their emergence. Their read lengths, potentially spanning entire transcripts, is advantageous for reconstructing transcriptomes. Existing long-read transcriptome assembly methods are primarily reference-based and to date, there is little focus on reference-free transcriptome assembly. We introduce “RNA-Bloom2 [https://github.com/bcgsc/RNA-Bloom]”, a reference-free assembly method for long-read transcriptome sequencing data. Using simulated dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 52 publications
0
10
0
Order By: Relevance
“…For example, Minstrobes have been used for long-read overlap detection ( Firtina et al 2023 ) and alternating strobe lengths have also been explored ( Maier and Sahlin 2023 ). However, randstrobes were shown to be more sensitive for sequence matching than other methods using fixed strobe lengths (minstrobes and hybridstrobes) ( Sahlin 2021a ), and simpler to construct than alternating strobe lengths (altstrobes and multistrobes) ( Maier and Sahlin 2023 ), and is so far most commonly implemented in practice ( Sahlin 2022 , Nip et al 2023 , Xu et al 2023 ). Therefore, we will consider only the randstrobes method in this study.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, Minstrobes have been used for long-read overlap detection ( Firtina et al 2023 ) and alternating strobe lengths have also been explored ( Maier and Sahlin 2023 ). However, randstrobes were shown to be more sensitive for sequence matching than other methods using fixed strobe lengths (minstrobes and hybridstrobes) ( Sahlin 2021a ), and simpler to construct than alternating strobe lengths (altstrobes and multistrobes) ( Maier and Sahlin 2023 ), and is so far most commonly implemented in practice ( Sahlin 2022 , Nip et al 2023 , Xu et al 2023 ). Therefore, we will consider only the randstrobes method in this study.…”
Section: Methodsmentioning
confidence: 99%
“…Randstrobes have been used, e.g. in for short-read mapping ( Sahlin 2022 ), transcriptomic long-read normalization ( Nip et al 2023 ), and read classification ( Xu et al 2023 ). Our recent study also demonstrates that randstrobes provide accurate sequence similarity ranking using the Jaccard distance ( Maier and Sahlin 2023 ).…”
Section: Introductionmentioning
confidence: 99%
“…As recommended for For RNA-Bloom2 [ 68 ], Pychopper-processed reads were used in the default mode (java -jar RNA-Bloom.jar -long processed.fq -outdir RNA-bloom_out/). We also tested short read-based correction of RNA-bloom2 assembly (java -jar RNA-Bloom.jar -long processed.fq -sef short-read.fastq -outdir RNA-bloom_out/).…”
Section: Comparison Of Transcript Assembly Programsmentioning
confidence: 99%
“…RNA-Bloom (Nip et al 2020) is a Java based assembly algorithm originally designed for single-cell short-read transcriptome assembly combining bloom filters with De Bruijn graph assembly. This approach to ab initio transcriptome assembly was improved upon and applied to long-read bulk sequencing data with RNA-Bloom2 (Nip et al 2023). The latter uses digital normalization with strobemers-a strategy reported to be less sensitive to mutations than k-mers-before assembling and polishing unitigs (Sahlin 2021).…”
Section: Introductionmentioning
confidence: 99%