A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data

Feng, Hao; Conneely, Karen N.; Wu, Hao

doi:10.1093/nar/gku154

Cited by 439 publications

(396 citation statements)

References 42 publications

Supporting

Mentioning

394

Contrasting

Order By: Relevance

“…The random set of regions was then annotated the same way as the original DMRs. Differential methylation analysis of CpG methylation among the datasets were further assessed using a Bayesian hierarchical model to detect differences among methylation at 3 CpG sites 36 .…”

Section: Methodsmentioning

confidence: 99%

Effector CD8 T cells dedifferentiate into long-lived memory cells

Youngblood

Hale

Kissick

et al. 2017

Nature

402

359

View full text Add to dashboard Cite

Memory CD8 T cells that circulate in the blood and are present in lymphoid organs are an essential component of long-lived T cell immunity. These resting memory CD8 T cells remain poised to rapidly elaborate effector functions upon re-exposure to pathogen, but also have many properties in common with naïve cells, including the ability to migrate to lymph nodes and spleen, and their pluri-potency. Thus, memory cells embody features of both naïve and effector cells, fueling a long-standing debate centered on whether memory T cells develop from effector cells or directly from naïve cells1–4. To better define the developmental path of memory CD8 T cells we investigated changes in DNA methylation programming at naïve and effector genes in virus specific CD8 T cells during acute LCMV infection of mice. Methylation profiling of effector CD8 T cell subsets at day 4 and 8 after infection showed that, rather than retaining a naïve epigenetic state, the subset of cells that gives rise to memory cells acquired de novo DNA methylation programs at naïve-associated genes and became demethylated at loci of classically defined effector molecules. Conditional deletion of the de novo methyltransferase, Dnmt3a, at an early stage of effector differentiation strikingly reduced methylation of naïve-associated genes and resulted in faster re-expression of these naïve genes, accelerating memory cell development. Longitudinal phenotypic and epigenetic characterization of virus-specific memory-precursor CD8 T cells transferred into antigen-free mice revealed that their differentiation into memory cells was coupled to cell-division independent erasure of de novo methylation programs and re-expression of naïve-associated genes. These data provide evidence that epigenetic repression of naïve-associated genes in effector CD8 T cells can be reversed in cells that develop into long-lived memory CD8 T cells supporting a differentiation model where memory T cells arise from a subset of fate-permissive effector T cells.

show abstract

Section: Methodsmentioning

confidence: 99%

Effector CD8 T cells dedifferentiate into long-lived memory cells

Youngblood

Hale

Kissick

et al. 2017

Nature

402

359

View full text Add to dashboard Cite

show abstract

“…Effectively, our model improves and generalizes the beta-binomial model by introducing this extra g term to model individual relatedness due to population structure or stratification. In the absence of g, our model becomes similar to other beta-binomial models previously developed for modeling count data [31,33,47,62].…”

Section: The Binomial Mixed Model and The Macau Algorithmmentioning

confidence: 94%

“…As a result, DNA methylation levels will frequently covary with kinship or population structure, and failure to account for this covariance could lead to spurious associations or reduced power to detect true effects. This phenomenon has been extensively documented for genotypephenotype association studies [35,36,[40][41][42], and controlling for genetic covariance between No DSS [31], MOABS [32], RadMeth [33] Linear mixed model No Yes Yes GEMMA [34], EMMA [35], EMMAX [36], FaST-LMM [37] Binomial mixed model Yes Yes Yes MACAU samples is now a basic requirement for genome-wide association studies. Similar logic applies to analyses of gene regulatory phenotypes and studies of gene expression variation often do take genetic structure into account by using mixed model approaches [43][44][45].…”

Section: Introductionmentioning

confidence: 99%

“…To address this problem, several recently introduced methods for differential DNA methylation analysis implement a beta-binomial model (e.g., 'DSS: Dispersion Shrinkage for Sequencing data' [31], 'RADMeth: Regression Analysis of Differential Methylation' [33], and 'MOABS: Model Based Analysis of Bisulfite Sequencing data' [32]). These methods model the binomial nature of bisulfite sequencing data, while taking into account the well-known problem of overdispersion in sequencing reads.…”

Section: Introductionmentioning

confidence: 99%

“…Because these methods work directly on count data, they can reliably account for variation in read coverage across sites and individuals. Consequently, betabinomial methods consistently provide increased power to detect true associations between genetic or environmental sources of variance and DNA methylation levels [31][32][33].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A flexible, efficient binomial mixed model for identifying differential DNA methylation in bisulfite sequencing data

Lea

Alberts

Tung

et al. 2015

Preprint

View full text Add to dashboard Cite

Identifying sources of variation in DNA methylation levels is important for understanding gene regulation. Recently, bisulfite sequencing has become a popular tool for investigating DNA methylation levels. However, modeling bisulfite sequencing data is complicated by dramatic variation in coverage across sites and individual samples, and because of the computational challenges of controlling for genetic covariance in count data. To address these challenges, we present a binomial mixed model and an efficient, sampling-based algorithm (MACAU: Mixed model association for count data via data augmentation) for approximate parameter estimation and p-value computation. This framework allows us to simultaneously account for both the over-dispersed, count-based nature of bisulfite sequencing data, as well as genetic relatedness among individuals. Using simulations and two real data sets (whole genome bisulfite sequencing (WGBS) data from Arabidopsis thaliana and reduced representation bisulfite sequencing (RRBS) data from baboons), we show that our method provides well-calibrated test statistics in the presence of population structure. Further, it improves power to detect differentially methylated sites: in the RRBS data set, MACAU detected 1.6-fold more age-associated CpG sites than a beta-binomial model (the next best approach). Changes in these sites are consistent with known age-related shifts in DNA methylation levels, and are enriched near genes that are differentially expressed with age in the same population. Taken together, our results indicate that MACAU is an efficient, effective tool for analyzing bisulfite sequencing data, with particular salience to analyses of structured populations. MACAU is freely available at www.xzlab. org/software.html. Author SummaryDNA methylation is an important epigenetic modification involved in regulating gene expression. It can be measured at base-pair resolution, on a genome-wide scale, by coupling sodium bisulfite conversion with high-throughput sequencing (a technique known as 'bisulfite sequencing'). However, the data generated by such methods present several challenges for statistical analysis. In particular, while the raw data generated from bisulfite sequencing experiments are read counts, they are often converted to proportions for ease of modeling, resulting in loss of information. Furthermore, although DNA methylation levels are known to be heritable-and are thus affected by kinship and population structure-existing approaches for modeling bisulfite sequencing data fail to account for this covariance. Such failure can lead to spurious associations and reduced power. Here, we present a new approach that models bisulfite sequencing data using raw read counts, while also taking into account population structure and other sources of data over-dispersion. Using simulations and two real data sets (publicly available data from Arabidopsis thaliana and newly generated data from Papio cynocephalus), we demonstrate that our model provides well-calibrated p-values and i...

show abstract

Circulating cell‐free DNA methylation‐based multi‐omics analysis allows early diagnosis of pancreatic ductal adenocarcinoma

Zhao,

Jiang,

Shi

et al. 2024

Molecular Oncology

View full text Add to dashboard Cite

Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with a 5‐year survival rate of 7.2% in China. However, effective approaches for diagnosis of PDAC are limited. Tumor‐originating genomic and epigenomic aberration in circulating free DNA (cfDNA) have potential as liquid biopsy biomarkers for cancer diagnosis. Our study aims to assess the feasibility of cfDNA‐based liquid biopsy assay for PDAC diagnosis. In this study, we performed parallel genomic and epigenomic profiling of plasma cfDNA from Chinese PDAC patients and healthy individuals. Diagnostic models were built to distinguish PDAC patients from healthy individuals. Cancer‐specific changes in cfDNA methylation landscape were identified, and a diagnostic model based on six methylation markers achieved high sensitivity (88.7% for overall cases and 78.0% for stage I patients) and specificity (96.8%), outperforming the mutation‐based model significantly. Moreover, the combination of the methylation‐based model with carbohydrate antigen 19‐9 (CA19‐9) levels further improved the performance (sensitivity: 95.7% for overall cases and 95.5% for stage I patients; specificity: 93.3%). In conclusion, our findings suggest that both methylation‐based and integrated liquid biopsy assays hold promise as non‐invasive tools for detection of PDAC.

show abstract

A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data

Cited by 439 publications

References 42 publications

Effector CD8 T cells dedifferentiate into long-lived memory cells

Effector CD8 T cells dedifferentiate into long-lived memory cells

A flexible, efficient binomial mixed model for identifying differential DNA methylation in bisulfite sequencing data

Circulating cell‐free DNA methylation‐based multi‐omics analysis allows early diagnosis of pancreatic ductal adenocarcinoma

Contact Info

Product

Resources

About