2021
DOI: 10.1101/2021.03.15.435283
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep learning-based pan-cancer classification model reveals cancer-specific gene expression signatures

Abstract: The identification of cancer-specific biomarkers and therapeutic targets is one of the primary goals of cancer genomics. Thousands of cancer genomes, exomes, and transcriptomes have been sequenced to date. In this study, we conducted a pan-cancer analysis of transcriptome datasets from 37 cancer types provided by The Cancer Genome Atlas (TCGA) in an effort to identify cancer-specific gene expression signatures. We employed deep neural networks to train a model on the transcriptome profile datasets for all canc… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(9 citation statements)
references
References 37 publications
0
9
0
Order By: Relevance
“…There are some limitations associated with our pan-cancer analysis-based assessment of cancer tissue-of-origin specific gene expression signatures [ 31 ]. For example, we only considered genes that were expressed at sufficiently high levels (≥5 FPKM) in at least 50% of samples within a cancer type.…”
Section: Discussionmentioning
confidence: 99%
“…There are some limitations associated with our pan-cancer analysis-based assessment of cancer tissue-of-origin specific gene expression signatures [ 31 ]. For example, we only considered genes that were expressed at sufficiently high levels (≥5 FPKM) in at least 50% of samples within a cancer type.…”
Section: Discussionmentioning
confidence: 99%
“…The prediction accuracy varies by tumor type, with some tumor types being more frequently mispredicted. Patterns of more frequent misclassifications among groups of cancers arising from the same organ (e.g., kidney renal clear cell carcinoma, kidney renal papillary cell carcinoma and kidney chromophobe carcinoma, or lung adenocarcinoma and lung squamous cell carcinoma), and/or among cancers represented by a small number of samples in the training set (e.g., cholangiocarcinoma, which is frequently predicted as liver hepatocellular carcinoma and vice versa), as noted in multiple studies (Bagge et al, 2018;Lyu and Haque, 2018;Bavafaye Haghighi et al, 2019;De Guia et al, 2019;Grewal et al, 2019;Zhao et al, 2020;Vibert et al, 2021;Divate et al, 2022;Jones et al, 2022;Moiso et al, 2022). All of this implies that the distribution of different cancer types in the training set is one of the key factors contributing to the prediction accuracy of the model.…”
Section: Performance Of Models For Tissue-oforigin Predictionmentioning
confidence: 99%
“…The majority of studies were based on deep learning, using neural networks of different architectures (Lyu and Haque, 2018;Azarkhalili et al, 2019;De Guia et al, 2019;He et al, 2020b;Mostavi et al, 2020;Zhao et al, 2020;Vibert et al, 2021;Divate et al, 2022;Hong et al, 2022;Jones et al, 2022;Moiso et al, 2022). Several studies utilized ensemble learning methods, in which the final prediction is a combination of multiple predictors (Grewal et al, 2019;He et al, 2020a;Ramroach et al, 2020;Chen et al, 2021;Liu et al, 2021).…”
Section: Machine Learning In Cancer Of Unknown Primary Classification...mentioning
confidence: 99%
See 2 more Smart Citations