Background There has been a steady increase in the number of studies aiming to identify DNA methylation differences associated with complex phenotypes. Many of the challenges of epigenetic epidemiology regarding study design and interpretation have been discussed in detail, however there are analytical concerns that are outstanding and require further exploration. In this study we seek to address three analytical issues. First, we quantify the multiple testing burden and propose a standard statistical significance threshold for identifying DNA methylation sites that are associated with an outcome. Second, we establish whether linear regression, the chosen statistical tool for the majority of studies, is appropriate and whether it is biased by the underlying distribution of DNA methylation data. Finally, we assess the sample size required for adequately powered DNA methylation association studies. Results We quantified DNA methylation in the Understanding Society cohort ( n = 1175), a large population based study, using the Illumina EPIC array to assess the statistical properties of DNA methylation association analyses. By simulating null DNA methylation studies, we generated the distribution of p -values expected by chance and calculated the 5% family-wise error for EPIC array studies to be 9 × 10 − 8 . Next, we tested whether the assumptions of linear regression are violated by DNA methylation data and found that the majority of sites do not satisfy the assumption of normal residuals. Nevertheless, we found no evidence that this bias influences analyses by increasing the likelihood of affected sites to be false positives. Finally, we performed power calculations for EPIC based DNA methylation studies, demonstrating that existing studies with data on ~ 1000 samples are adequately powered to detect small differences at the majority of sites. Conclusion We propose that a significance threshold of P < 9 × 10 − 8 adequately controls the false positive rate for EPIC array DNA methylation studies. Moreover, our results indicate that linear regression is a valid statistical methodology for DNA methylation studies, despite the fact that the data do not always satisfy the assumptions of this test. These findings have implications for epidemiological-based studies of DNA methylation and provide a framework for the interpretation of findings from current and future studies. Electronic supplementary material The online version of this article (10.1186/s12864-019-5761-7) contains supplementary material, which is available to authorized users.
We performed a systematic analysis of blood DNA methylation profiles from 4,483 participants from seven independent cohorts identifying differentially methylated positions (DMPs) associated with psychosis, schizophrenia and treatment-resistant schizophrenia. Psychosis cases were characterized by significant differences in measures of blood cell proportions and elevated smoking exposure derived from the DNA methylation data, with the largest differences seen in treatment-resistant schizophrenia patients. We implemented a stringent pipeline to meta-analyze epigenome-wide association study (EWAS) results across datasets, identifying 95 DMPs associated with psychosis and 1,048 DMPs associated with schizophrenia, with evidence of colocalization to regions nominated by genetic association studies of disease. Many schizophrenia-associated DNA methylation differences were only present in patients with treatment-resistant schizophrenia, potentially reflecting exposure to the atypical antipsychotic clozapine. Our results highlight how DNA methylation data can be leveraged to identify physiological (e.g., differential cell counts) and environmental (e.g., smoking) factors associated with psychosis and molecular biomarkers of treatment-resistant schizophrenia.
Most epigenome-wide association studies (EWAS) quantify DNA methylation (DNAm) in peripheral tissues such as whole blood to identify positions in the genome where variation is statistically associated with a trait or exposure. As whole blood comprises a mix of cell types, it is unclear whether trait-associated DNAm variation is specific to an individual cellular population. We collected three peripheral tissues (whole blood, buccal epithelial and nasal epithelial cells) from thirty individuals. Whole blood samples were subsequently processed using fluorescence-activated cell sorting (FACS) to purify five constituent cell-types (monocytes, granulocytes, CD4+ T cells, CD8+ T cells, and B cells). DNAm was profiled in all eight sample-types from each individual using the Illumina EPIC array. We identified significant differences in both the level and variability of DNAm between different sample types, and DNAm data-derived estimates of age and smoking were found to differ dramatically across sample types from the same individual. We found that for the majority of loci variation in DNAm in individual blood cell types was only weakly predictive of variance in DNAm measured in whole blood, although the proportion of variance explained was greater than that explained by either buccal or nasal epithelial samples. Covariation across sample types was much higher for DNAm sites influenced by genetic factors. Overall, we observe that DNAm variation in whole blood is additively influenced by a combination of the major blood cell types. For a subset of sites, however, variable DNAm detected in whole blood can be attributed to variation in a single blood cell type providing potential mechanistic insight about EWAS findings. Our results suggest that associations between whole blood DNAm and traits or exposures reflect differences in multiple cell types and our data will facilitate the interpretation of findings in epigenetic epidemiology.
Depression is a common and disabling disorder, representing a major social and economic health issue. Moreover, depression is associated with the progression of diseases with an inflammatory etiology including many inflammatory-related disorders. At the molecular level, the mechanisms by which depression might promote the onset of these diseases and associated immune-dysfunction are not well understood. In this study we assessed genome-wide patterns of DNA methylation in whole blood-derived DNA obtained from individuals with a self-reported history of depression (n = 100) and individuals without a history of depression (n = 100) using the Illumina 450K microarray. Our analysis identified six significant (Šidák corrected P < 0.05) depression-associated differentially methylated regions (DMRs); the top-ranked DMR was located in exon 1 of the LTB4R2 gene (Šidák corrected P = 1.27 × 10-14). Polygenic risk scores (PRS) for depression were generated and known biological markers of inflammation, telomere length (TL) and IL-6, were measured in DNA and serum samples, respectively. Next, we employed a systems-level approach to identify networks of co-methylated loci associated with a history of depression, in addition to depression PRS, TL and IL-6 levels. Our analysis identified one depression-associated co-methylation module (P = 0.04). Interestingly, the depression-associated module was highly enriched for pathways related to immune function and was also associated with TL and IL-6 cytokine levels. In summary, our genome-wide DNA methylation analysis of individuals with and without a self-reported history of depression identified several candidate DMRs of potential relevance to the pathogenesis of depression and its associated immune-dysfunction phenotype.
Accumulating evidence suggests that individuals exposed to victimization at key developmental stages may have different epigenetic fingerprints compared to those exposed to no/minimal stressful events, however results are inconclusive. This study aimed to strengthen causal inference regarding the impact of adolescent victimization on the epigenome by controlling for genetic variation, age, gender, and shared environmental exposures. We conducted longitudinal epigenome-wide association analyses (EWAS) on DNA methylation (DNAm) profiles of 118 monozygotic (MZ) twin pairs from the Environmental Risk study with and without severe adolescent victimization generated using buccal DNA collected at ages 5, 10 and 18, and the Illumina EPIC array. Additionally, we performed cross-sectional EWAS on age-18 blood and buccal DNA from the same individuals to elucidate tissue-specific signatures of severe adolescent victimization. Our analyses identified 20 suggestive differentially methylated positions (DMPs) (P < 5e-05), with altered DNAm trajectories between ages 10-18 associated with severe adolescent victimization (∆Beta range = −5.5%−5.3%). Age-18 cross-sectional analyses revealed 72 blood (∆Beta range = −2.2% −3.4%) and 42 buccal (∆Beta range = −3.6%−4.6%) suggestive severe adolescent victimizationassociated DMPs, with some evidence of convergent signals between these two tissue types. Downstream regional analysis identified significant differentially methylated regions (DMRs) in LGR6 and ANK3 (Šidák P = 5e-09 and 4.07e-06), and one upstream of CCL27 (Šidák P = 2.80e-06) in age-18 blood and buccal EWAS, respectively. Our study represents the first longitudinal MZ twin analysis of DNAm and severe adolescent victimization, providing initial evidence for altered DNA methylomic signatures in individuals exposed to adolescent victimization.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.