2020
DOI: 10.1186/s12859-020-03562-x
|View full text |Cite
|
Sign up to set email alerts
|

HPC-REDItools: a novel HPC-aware tool for improved large scale RNA-editing analysis

Abstract: Background: RNA editing is a widespread co-/post-transcriptional mechanism that alters primary RNA sequences through the modification of specific nucleotides and it can increase both the transcriptome and proteome diversity. The automatic detection of RNA-editing from RNA-seq data is computational intensive and limited to small data sets, thus preventing a reliable genome-wide characterisation of such process. Results: In this work we introduce HPC-REDItools, an upgraded tool for accurate RNA-editing events di… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
33
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 38 publications
(35 citation statements)
references
References 18 publications
0
33
0
Order By: Relevance
“…The de novo detection of RNA editing events was performed at the CINECA HPC Data Center (Italy) consuming ∼30 millions of CPU hours running an optimized version of our REDItools package whose algorithm scales almost linearly with the number of available cores ( 25 ). Indeed, the editing identification in massive transcriptome sequencing data is computationally intensive and time consuming, requiring the screening of the entire human genome, position by position, in order to look at nucleotide differences between RNA reads and the corresponding reference genomic site.…”
Section: Data Collection and Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…The de novo detection of RNA editing events was performed at the CINECA HPC Data Center (Italy) consuming ∼30 millions of CPU hours running an optimized version of our REDItools package whose algorithm scales almost linearly with the number of available cores ( 25 ). Indeed, the editing identification in massive transcriptome sequencing data is computationally intensive and time consuming, requiring the screening of the entire human genome, position by position, in order to look at nucleotide differences between RNA reads and the corresponding reference genomic site.…”
Section: Data Collection and Processingmentioning
confidence: 99%
“…To speed up the browsing and traversing of aligned RNAseq reads, the editing detection was distributed over multiple computing nodes, each working on a given genomic interval. Moreover, the editing identification at nucleotide level was improved by a novel routine developed to increase the data loading efficiency, raising the algorithm performances of 8–10 times ( 25 ).…”
Section: Data Collection and Processingmentioning
confidence: 99%
“…Taking Huntington disease (HD) as an example, it also introduces how to compare the differences in RNA editing levels in different tissues. Moreover, they also developed high-performance HPC-REDItools for large-scale samples, which greatly improves the speed of operation ( 100 ). For the identification of RNA editing in tumor samples with only RNA-seq, we recommend HISAT2 to handle RNA-seq data and REDITools to conduct RNA editing calling.…”
Section: Rna Editing Site Identificationmentioning
confidence: 99%
“…REDItools 2[19, 20] and JACUSA[21] were used together to call the SNVs in the transcriptome of SARS-CoV-2. With REDItools 2, all SNVs within 15 nucleotides from the beginning or the end of the reads were removed to avoid artifacts due to misalignments.…”
Section: Methodsmentioning
confidence: 99%