Background
Major depressive disorder (MDD) is a leading cause of disability worldwide, and is commonly treated with antidepressant drugs (AD). Although effective, many patients fail to respond to AD treatment, and accordingly identifying factors that can predict AD response would greatly improve treatment outcomes. In this study, we developed a machine learning tool to integrate multi-omic datasets (gene expression, DNA methylation, and genotyping) to identify biomarker profiles associated with AD response in a cohort of individuals with MDD.
Materials and methods
Individuals with MDD (N = 111) were treated for 8 weeks with antidepressants and were separated into responders and non-responders based on the Montgomery–Åsberg Depression Rating Scale (MADRS). Using peripheral blood samples, we performed RNA-sequencing, assessed DNA methylation using the Illumina EPIC array, and performed genotyping using the Illumina PsychArray. To address this rich multi-omic dataset with high dimensional features, we developed integrative Geneset-Embedded non-negative Matrix factorization (iGEM), a non-negative matrix factorization (NMF) based model, supplemented with auxiliary information regarding gene sets and gene-methylation relationships. In particular, we factorize the subjects by features (i.e., gene expression or DNA methylation) into subjects-by-factors and factors-by-features. We define the factors as the meta-phenotypes as they represent integrated composite scores of the molecular measurements for each subject.
Results
Using our model, we identified a number of meta-phenotypes which were related to AD response. By integrating geneset information into the model, we were able to relate these meta-phenotypes to biological processes, including a meta-phenotype related to immune and inflammatory functions as well as other genes related to depression or AD response. The meta-phenotype identified several genes including immune interleukin 1 receptor like 1 (IL1RL1) and interleukin 5 receptor (IL5) subunit alpha (IL5RA), AKT/PIK3 pathway related phosphoinositide-3-kinase regulatory subunit 6 (PIK3R6), and sphingomyelin phosphodiesterase 3 (SMPD3), which has been identified as a target of AD treatment.
Conclusions
The derived meta-phenotypes and associated biological functions represent both biomarkers to predict response, as well as potential new treatment targets. Our method is applicable to other diseases with multi-omic data, and the software is open source and available on Github (https://github.com/li-lab-mcgill/iGEM).