2018
DOI: 10.1261/rna.062802.117
|View full text |Cite
|
Sign up to set email alerts
|

Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers?

Abstract: RNA sequencing (RNA-seq) is becoming a prevalent approach to quantify gene expression and is expected to gain better insights into a number of biological and biomedical questions compared to DNA microarrays. Most importantly, RNA-seq allows us to quantify expression at the gene or transcript levels. However, leveraging the RNA-seq data requires development of new data mining and analytics methods. Supervised learning methods are commonly used approaches for biological data analysis that have recently gained at… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
31
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(33 citation statements)
references
References 70 publications
2
31
0
Order By: Relevance
“…S1 A and B). This result is consistent with prior observations that the exon-level splicing analysis is more robust against batch effects and other confounding factors in large-scale RNA-Seq datasets (35)(36)(37).…”
Section: Resultssupporting
confidence: 92%
See 1 more Smart Citation
“…S1 A and B). This result is consistent with prior observations that the exon-level splicing analysis is more robust against batch effects and other confounding factors in large-scale RNA-Seq datasets (35)(36)(37).…”
Section: Resultssupporting
confidence: 92%
“…Metaanalyses of RNA-Seq data with gene-or isoform-level counts are subject to confounding batch effects and rely on existing isoform annotation (33). Exon-level analysis, however, uses a ratio-based methodology to estimate exon incorporation, which may be more robust against batch effects and confounding factors in large-scale RNA-Seq datasets (34)(35)(36)(37). In addition, exon-level analysis can detect novel exon-exon junctions and is thus independent of previous annotation.…”
Section: Resultsmentioning
confidence: 99%
“…Gene expression can be simply defined as a function of one or more factors of the environment, lifestyle, and genetics. RNA-Seq technology has become a prevalent approach to quantify gene expression that is expected to gain better insights to a number of biological and biomedical questions, compared to DNA microarray technology (Johnson, Dhroso, Hughes, & Korkin, 2018). The processing of RNA-Seq gene expression includes many stages to obtain data matrix (RNASeqV2 level 3 expression data) (MIT and Harvard, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…The emergence of high throughput sequencing of large disease cohorts [19][20][21] , and the remarkable efforts to aggregate and annotate these mutations in an accessible infrastructure such as ClinVar 22 , now provides an unprecedented opportunity to apply novel deep learning approaches to predict mutations that affect pre-mRNA splicing 23 . The potential of developing such models will continue to increase as next generation transcriptome sequencing (RNASeq) data are amassed and curation of the associated mutational processes matures [23][24][25][26] .…”
mentioning
confidence: 99%