2020
DOI: 10.1093/gigascience/giaa076
|View full text |Cite
|
Sign up to set email alerts
|

Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning–based neural network

Abstract: Abstract Background Gene expression plays a key intermediate role in linking molecular features at the DNA level and phenotype. However, owing to various limitations in experiments, the RNA-seq data are missing in many samples while there exist high-quality of DNA methylation data. Because DNA methylation is an important epigenetic modification to regulate gene expression, it can be used to p… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
43
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 43 publications
(43 citation statements)
references
References 37 publications
0
43
0
Order By: Relevance
“…For instance, TDimpute aims at generating missing RNA-seq data by training a DNN on methylation data. They used TCGA and TARGET (https://ocg.cancer.gov/programs/target/data-matrix, accessed on 20 May 2021) data as proof of concept of the applicability of DNN for data imputation in a multi-omics integration study [120]. Because this integrative model can exploit information in different levels of regulatory mechanisms, it can build a more detailed model and achieve better performance than a model build on a single-omics dataset [117,121].…”
Section: Challenges For Deep Learning In Cancer Researchmentioning
confidence: 99%
“…For instance, TDimpute aims at generating missing RNA-seq data by training a DNN on methylation data. They used TCGA and TARGET (https://ocg.cancer.gov/programs/target/data-matrix, accessed on 20 May 2021) data as proof of concept of the applicability of DNN for data imputation in a multi-omics integration study [120]. Because this integrative model can exploit information in different levels of regulatory mechanisms, it can build a more detailed model and achieve better performance than a model build on a single-omics dataset [117,121].…”
Section: Challenges For Deep Learning In Cancer Researchmentioning
confidence: 99%
“…Yet, these methods are limited due to cancer data from small sample sizes (11,12). One common strategy to solve this problem is transfer learning, where models are pre-trained based on similar cancer types and then fine-tuned on the target cancer type (13,14). For example, Vanacker used the messenger RNA (mRNA) expression data collected from The Cancer Genome Atlas (TCGA) to train the initial deep neural network and fine-tune it for patient risk estimation (15).…”
Section: Introductionmentioning
confidence: 99%
“…However, they came up with the conclusion that DNA methylation data had limited prediction power for gene expression, which is mainly because of the rather low capacity of their network according to our experiments. TDimpute [Zhou et al, 2020] is a more recent work in this field that applied a quite straightforward neural network to impute missing RNA-Seq data from DNA methylation data. They claimed that TDimpute significantly outperformed other stateof-the-art methods including singular value decomposition imputation (SVD), trans-omics block missing data imputation (TOBMI) [Dong et al, 2019], and LASSO [Tibshirani, 1996].…”
Section: Introductionmentioning
confidence: 99%
“…They claimed that TDimpute significantly outperformed other stateof-the-art methods including singular value decomposition imputation (SVD), trans-omics block missing data imputation (TOBMI) [Dong et al, 2019], and LASSO [Tibshirani, 1996]. TDimpute [Zhou et al, 2020] is currently the most state-ofthe-art method that is able to predicting gene expression from DNA methylation, and it is our major comparing method.…”
Section: Introductionmentioning
confidence: 99%