2023
DOI: 10.1101/2023.09.01.555813
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A pan-MHC reference graph with 246 fully contiguous phased sequences

Liza Huijse,
Solomon M. Adams,
Joshua N. Burton
et al.

Abstract: The major histocompatibility complex (MHC) is a region of the human genome that is key to immune system function but sometimes refractory to genomic analyses due to extreme polymorphism and structural variation. We performed targeted long-read sequencing and de novo assembly of MHC to create 246 highly accurate, fully contiguous, and phased full-length sequences, mostly from data provided by the Human Pangenome Reference Consortium (HPRC). We identified alleles at high resolution across 39 loci including the c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 77 publications
(144 reference statements)
0
2
0
Order By: Relevance
“…The nucleotide length of each error was summed for each size category and a final percent error was calculated for each assembly by dividing the number of erroneous base pairs in each category by the full length of assembled sequence. To evaluate the impact of heterozygous, diploid sequence data on assembly accuracy, we used three 1000GenomesProject 30X WGS samples with long read data that has recently been assembled into phased MHC haplotypes (Huijse et al . 2023).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The nucleotide length of each error was summed for each size category and a final percent error was calculated for each assembly by dividing the number of erroneous base pairs in each category by the full length of assembled sequence. To evaluate the impact of heterozygous, diploid sequence data on assembly accuracy, we used three 1000GenomesProject 30X WGS samples with long read data that has recently been assembled into phased MHC haplotypes (Huijse et al . 2023).…”
Section: Methodsmentioning
confidence: 99%
“…rate. To further validate this method and ensure that it can successfully handle the heterozygous, diploid sequence data found in human populations, we used MHConstructor to re-assemble phased, diploid MHC reference sequences (Huijse et al 2023) from corresponding 1000GenomesProject 30X WGS reads. We generated the assemblies using the long-read sequenced, phased haplotype sequence as the guide sequence.…”
Section: Use Of Phased Long-read Mhc Sequences To Determine Diploid D...mentioning
confidence: 99%