DNA methylation is an epigenetic mark implicated in crucial biological processes. Most of the knowledge about DNA methylation and its regulatory role is based on bulk experiments, in which DNA methylation of genomic regions is reported as average methylation, i.e. the fraction of methylated cytosines across a pool of cells. However, average methylation does not inform on how methylated cytosines are distributed in each single DNA molecule.
In this study we propose Methylation Class (MC) profiling as a genome-wide approach to the study of DNA methylation heterogeneity from bulk bisulfite sequencing experiments. The proposed approach is built on the concept of MCs, i.e. groups of DNA molecules sharing the same number of methylated cytosines in a sample. The analysis of the relative abundances of MCs from sequencing reads incorporates the information on the overall average methylation, and directly informs on the methylation level of each molecule.
By applying our approach to several publicly available Reduced Representation Bisulfite Sequencing (RRBS) datasets, we individuated cell-to-cell differences as the prevalent contributor to DNA methylation heterogeneity captured by MC profiles. Moreover, we individuated signatures of loci undergoing genomic imprinting and X inactivation, and highlighted differences between the two processes. When applying MC profiling to compare different conditions, we identified DNA methylation changes occurring in regions with almost constant overall average methylation.
Altogether, our results indicate that MC profiling can provide useful insights on the epigenetic status and its evolution at multiple genomic regions.