2023
DOI: 10.1038/s41467-023-37266-6
|View full text |Cite
|
Sign up to set email alerts
|

Integrated analysis of genomic and transcriptomic data for the discovery of splice-associated variants in cancer

Abstract: Somatic mutations within non-coding regions and even exons may have unidentified regulatory consequences that are often overlooked in analysis workflows. Here we present RegTools (www.regtools.org), a computationally efficient, free, and open-source software package designed to integrate somatic variants from genomic data with splice junctions from bulk or single cell transcriptomic data to identify variants that may cause aberrant splicing. We apply RegTools to over 9000 tumor samples with both tumor DNA and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(29 citation statements)
references
References 89 publications
(129 reference statements)
0
29
0
Order By: Relevance
“…IRIS [24] is a modular neoantigen prediction pipeline based on RNA-seq and supports customized pipelines. Regtools [28] provides functionalities to integrate DNA sequencing and RNA-seq data to identify potential splice-associated variants. DICAST [44] integrates several RNA-seq splicing tools for unified junction analysis.…”
Section: Discussionmentioning
confidence: 99%
“…IRIS [24] is a modular neoantigen prediction pipeline based on RNA-seq and supports customized pipelines. Regtools [28] provides functionalities to integrate DNA sequencing and RNA-seq data to identify potential splice-associated variants. DICAST [44] integrates several RNA-seq splicing tools for unified junction analysis.…”
Section: Discussionmentioning
confidence: 99%
“…I. Li et al 2017): reads were aligned to hg38 using STAR v2.7.1 with parameters ‘--twopassMode’ and ‘--outSAMstrandField intronMotif’ (Dobin et al 2013). Sequencing reads that overlap exon-exon junctions were counted using ‘regtools junction extract’ v0.5.2 with parameters ‘-a 8 -m 50 -M 500000 -s 1’ (Cotto et al 2023). Next, to define intron clusters using LeafCutter, we ran the ‘leafcutter_cluster_regtools.py’ python script with ‘-m 30 -p 0.01 -l 500000 parameters’ (Y.…”
Section: Methodsmentioning
confidence: 99%
“…A375 gene expression, percent gene nuclear, and percent junction nuclear were calculated from A375 RNA-seq data generated in this paper (see ‘Methods: Illumina short read sequencing analysis’ and Supplemental Table S3 for additional information). Briefly, gene counts were obtained using featureCounts (Liao, Smyth, and Shi 2013) and were normalized to RPKM (Reads Per Kilobase Million) and junction counts were obtained using RegTools (Cotto et al 2023). Percent nuclear values were obtained by dividing the nuclear read counts by the sum of the nuclear and cytoplasmic read counts for a gene or junction.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To quantify splicing in the RNA-seq datasets, we extracted junction reads by running regtools version 0.5.2 80 on the filtered BAM files, requiring a minimum intron length of 20 bp. We merged the resulting .junc files into a single database of observed splice junctions and splice junction read counts across all samples.…”
Section: Methodsmentioning
confidence: 99%