2019
DOI: 10.1101/803692
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Imputing missing RNA-seq data from DNA methylation by using transfer learning based neural network

Abstract: Multi-omics integrative analysis can capture the associations of different omics and thus provides a comprehensive view of the complex mechanisms in cancers. However, it is common that one portion of samples miss one type of omics data due to various limitations in experiments, which can be an obstacle for downstream analysis where complete dataset is needed. Current imputation methods mainly focus on single cancer dataset, which are limited by their ability to capture information from large pan-cancer dataset… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2
2
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 35 publications
0
6
0
Order By: Relevance
“…It reliably imputes RNA-seq data by making use of external data from DNA methylation probe datasets. TDimpute (Zhou X. et al, 2019) is a deep neural network (DNN)-based transfer learning approach that imputes missing gene expression data using DNA methylation datasets. It employs a DNN model to recover missing gene expression data by constructing a non-linear mapping between DNA methylation data and gene expression data.…”
Section: Integrating Epigenomic and Transcriptomic Datamentioning
confidence: 99%
See 1 more Smart Citation
“…It reliably imputes RNA-seq data by making use of external data from DNA methylation probe datasets. TDimpute (Zhou X. et al, 2019) is a deep neural network (DNN)-based transfer learning approach that imputes missing gene expression data using DNA methylation datasets. It employs a DNN model to recover missing gene expression data by constructing a non-linear mapping between DNA methylation data and gene expression data.…”
Section: Integrating Epigenomic and Transcriptomic Datamentioning
confidence: 99%
“…Since gene-trait associations are mostly detected in strongly relevant tissues, it is recommended to use trait-relevant tissues in order to boost the correlation between GReX of related tissues (Zhang et al, 2019). For the TDimpute model, it can be further improved by integrating prior biological knowledge regarding the gene-gene interaction factors in order to reduce the parameters of the DNN model (Zhou X. et al, 2019).…”
Section: Integrating Epigenomic and Transcriptomic Datamentioning
confidence: 99%
“…The datasets and pretrained pan-cancer models supporting the results of this article are available in the Synapse with ID: syn21438134 [ 44 ]. Snapshots of our code and data further supporting this work are openly available in the GigaScience repository, GigaDB [ 45 ].…”
Section: Availability Of Supporting Data and Materialsmentioning
confidence: 99%
“…Furthermore, to impute entirely missing expression profiles, one must incorporate additional domain information, such as data from nearby time points or functional relationships between genes' expression patterns. In another example, transfer learning has been used to impute entire bulk RNA-sequencing profiles when methylation profiles for the same samples are available (Zhou et al, 2020). Here, we use expression profiles of related drugs and cells.…”
Section: Introductionmentioning
confidence: 99%