2020
DOI: 10.1186/s12859-020-3516-8
|View full text |Cite
|
Sign up to set email alerts
|

methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder

Abstract: Background: Recently, DNA methylation has drawn great attention due to its strong correlation with abnormal gene activities and informative representation of the cancer status. As a number of studies focus on DNA methylation signatures in cancer, demand for utilizing publicly available methylome dataset has been increased. To satisfy this, large-scale projects were launched to discover biological insights into cancer, providing a collection of the dataset. However, public cancer data, especially for certain ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(17 citation statements)
references
References 27 publications
0
17
0
Order By: Relevance
“…erefore, the cost function of SDAE needs to be determined. At present, cost functions used in deep learning models are mainly cross entropy cost functions, as shown in formula (17) [20]. In addition, the mean square cost function is shown in formula (18) [21].…”
Section: Cost Function Selectionmentioning
confidence: 99%
“…erefore, the cost function of SDAE needs to be determined. At present, cost functions used in deep learning models are mainly cross entropy cost functions, as shown in formula (17) [20]. In addition, the mean square cost function is shown in formula (18) [21].…”
Section: Cost Function Selectionmentioning
confidence: 99%
“…There are many FS techniques; they can be divided into three categories such as filter, wrapper, and embedded; are different in the way each technique copes with a higher dimension to form a subset of features. Most of the DNA methylation-based cancer studies used variance-based filtering FS techniques to select the most variable CpG sites across several samples before performing VAE and classification algorithms [26]- [31]. The advantages of filter techniques are simple and fewer computations compared to the other two categories.…”
Section: A Feature Selectionmentioning
confidence: 99%
“…However, public cancer data is rapidly increasing, there is also a lack of samples for specific cancer types in research. To alleviate this issue, methCancergen [31] was presented to generate a user-specified cancer type dataset by employing conditional VAE and a neural network-based generative model. It estimates the conditional probability distribution with latent variables and data and produces samples for specific cancer types.…”
Section: B Cancer Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Our framework for making VAEs interpretable is generalizable to other VAE-based frameworks. Given that VAEs have been applied to a wide range of genomics data modalities (epigenomics [16][17][18] and miRNA 19 ) and analysis (visualization 20,21 , trajectory inference 22 , data imputation 23 , and perturbation response prediction [24][25][26] ), our work can therefore enable interpretability in a wide range of downstream applications of VAEs.…”
Section: Introductionmentioning
confidence: 99%