AbstractBackgroundGene expression plays a key intermediate role in linking molecular features at the DNA level and phenotype. However, owing to various limitations in experiments, the RNA-seq data are missing in many samples while there exist high-quality of DNA methylation data. Because DNA methylation is an important epigenetic modification to regulate gene expression, it can be used to predict RNA-seq data. For this purpose, many methods have been developed. A common limitation of these methods is that they mainly focus on a single cancer dataset and do not fully utilize information from large pan-cancer datasets.ResultsHere, we have developed a novel method to impute missing gene expression data from DNA methylation data through a transfer learning–based neural network, namely, TDimpute. In the method, the pan-cancer dataset from The Cancer Genome Atlas (TCGA) was utilized for training a general model, which was then fine-tuned on the specific cancer dataset. By testing on 16 cancer datasets, we found that our method significantly outperforms other state-of-the-art methods in imputation accuracy with a 7–11% improvement under different missing rates. The imputed gene expression was further proved to be useful for downstream analyses, including the identification of both methylation–driving and prognosis-related genes, clustering analysis, and survival analysis on the TCGA dataset. More importantly, our method was indicated to be useful for general purposes by an independent test on the Wilms tumor dataset from the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) project.ConclusionsTDimpute is an effective method for RNA-seq imputation with limited training samples.
Total mortality and sudden cardiac death is highly prevalent in patients with chronic kidney disease (CKD). In CKD patients, the protein-bound uremic retention solute indoxyl sulfate (IS) is independently associated with cardiovascular disease. However, the underlying mechanisms of this association have yet to be elucidated. The relationship between IS and cardiac electrocardiographic parameters was investigated in a prospective observational study among early CKD patients. IS arrhythmogenic effect was evaluated by in vitro cardiomyocyte electrophysiological study and mathematical computer simulation. In a cohort of 100 early CKD patients, patients with corrected QT (QTc) prolongation had higher IS levels. Furthermore, serum IS level was independently associated with prolonged QTc interval. In vitro, the delay rectifier potassium current (IK) was found to be significantly decreased after the treatment of IS in a dose-dependent manner. The modulation of IS to the IK was through the regulation of the major potassium ion channel protein Kv 2.1 phosphorylation. In a computer simulation, the decrease of IK by IS could prolong the action potential duration (APD) and induce early afterdepolarization, which is known to be a trigger mechanism of lethal ventricular arrhythmias. In conclusion, serum IS level is independently associated with the prolonged QTc interval in early CKD patients. IS down-regulated IK channel protein phosphorylation and the IK current activity that in turn increased the cardiomyocyte APD and QTc interval in vitro and in the computer ORd model. These findings suggest that IS may play a role in the development of arrhythmogenesis in CKD patients.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.