2024
DOI: 10.1101/2024.02.27.581927
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploring a large cancer cell line RNA-sequencing dataset with k-mers

Chloé Bessière,
Haoliang Xue,
Benoit Guibert
et al.

Abstract: Analyzing the immense diversity of RNA isoforms in large RNA-seq repositories requires laborious data processing using specialized tools. Indexing techniques based on k-mers have previously been effective at searching for RNA sequences across thousands of RNA-seq libraries but falling short of enabling direct RNA quantification. We show here that RNAs queried in the form of k-mer sets can be quantified in seconds, with a precision akin to that of conventional RNA quantification methods. We showcase several app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 34 publications
0
3
0
Order By: Relevance
“…However, recent advances in the indexing of large-scale biological sequence datasets have brought solutions leading to different large scale queryable indexes. Examples include the entirety of assembled bacterial genomes [5], or creating a searchable, quantitative index for RNA-seq data in cancer research [1]. Various methodologies, all of them based on k -mers, have been introduced to this extent.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, recent advances in the indexing of large-scale biological sequence datasets have brought solutions leading to different large scale queryable indexes. Examples include the entirety of assembled bacterial genomes [5], or creating a searchable, quantitative index for RNA-seq data in cancer research [1]. Various methodologies, all of them based on k -mers, have been introduced to this extent.…”
Section: Introductionmentioning
confidence: 99%
“…Examples include the entirety of assembled bacterial genomes [5], or creating a searchable, quantitative index for RNA-seq data in cancer research [1]. Various methodologies, all of them based on k-mers, have been introduced to this extent.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation