2023
DOI: 10.1038/s41467-023-38782-1
|View full text |Cite
|
Sign up to set email alerts
|

Direct haplotype-resolved 5-base HiFi sequencing for genome-wide profiling of hypermethylation outliers in a rare disease cohort

Abstract: Long-read HiFi genome sequencing allows for accurate detection and direct phasing of single nucleotide variants, indels, and structural variants. Recent algorithmic development enables simultaneous detection of CpG methylation for analysis of regulatory element activity directly in HiFi reads. We present a comprehensive haplotype resolved 5-base HiFi genome sequencing dataset from a rare disease cohort of 276 samples in 152 families to identify rare (~0.5%) hypermethylation events. We find that 80% of these ev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 43 publications
0
6
0
Order By: Relevance
“…We generated or analyzed data from testis tissue of chimpanzee, gorilla, bonobo, Sumatran and Bornean orangutan (Makova et al 2023) and from pooled human fetal brain tissue (Supplementary Table S3). Additionally, we analyzed a very deep pool of ∼500 million human FLNC recently generated from induced pluripotent stem cells (iPSCs) (Cheung et al 2023). We mapped FLNC reads to both haplotypes of the respective species of origin genome assemblies, allowing only high-quality mappings, and tracking all best map assignments versus multiple mappings among the paralogous copies for each species (Fig.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We generated or analyzed data from testis tissue of chimpanzee, gorilla, bonobo, Sumatran and Bornean orangutan (Makova et al 2023) and from pooled human fetal brain tissue (Supplementary Table S3). Additionally, we analyzed a very deep pool of ∼500 million human FLNC recently generated from induced pluripotent stem cells (iPSCs) (Cheung et al 2023). We mapped FLNC reads to both haplotypes of the respective species of origin genome assemblies, allowing only high-quality mappings, and tracking all best map assignments versus multiple mappings among the paralogous copies for each species (Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Sequencing and assembly data are available in NCBI under the following BioProject names: owl monkey: PRJNA941350; macaque: PRJNA877605; marmoset: PRJNA941358; gorilla: PRJNA916732 and PRJNA916733; bonobo: PRJNA916735 and PRJNA916734; chimpanzee: PRJNA916736 and PRJNA916737. Human iPSC Iso-Seq libraries were generated in Cheung et al 2023 and made available in the dbGAP database under accession code phs002206.v4.p1. Fetal brain tissue sequenced is from BioSample SAMN09459150, and sequencing data made available under SRA SUB14282898.…”
Section: Data Accessmentioning
confidence: 99%
“…Recent work has shown that long-read sequencing can haplotype-resolve epigenetic variation ( e.g. CpG methylation, protein-DNA interactions) in complex genomic regions [60,61]. To assess the potential for long-read epigenetic assays to characterize IGH in a haplotype-resolved manner through use of an IGH-personalized reference, we determined the mappability of k-mers ranging from 1,000 - 25,000 bp for NA12878 and NA19240 ( Figure 5 ).…”
Section: Resultsmentioning
confidence: 99%
“…We also confirm the properties of SVs linking to QTLs impacting at greater distances and at a higher frequency. Hypermethylation in regulatory elements canonically leads to loss of activity, which we previously exploited in rare variant characterization of long-read sequences (Cheung et al 2023). The hypermethylating impact we observed for leading SV-mQTLs suggests an important role for studying SVs in gene silencing.…”
Section: Discussionmentioning
confidence: 99%
“…Expanding the non-reference results to a larger number of human genomes and epigenomes can expose population variation with potential new insights on trait variation and disease. The ability to survey epigenomes was recently augmented by long-read technologies that simultaneously characterize the sequence of personal genomes, resolving polymorphic SVs, together with their epigenomic status (Yue et al 2022; Cheung et al 2023; Sigurpalsdottir et al 2024).…”
Section: Introductionmentioning
confidence: 99%