We have updated the catalogue of common and well-documented (CWD) HLA alleles to reflect current understanding of the prevalence of specific allele sequences. The original CWD catalogue designated 721 alleles at the HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, and –DPB1 loci in IMGT/HLA Database release 2.15.0 as being CWD. The updated CWD catalogue designates 1122 alleles at the HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1 and –DPB1 loci as being CWD, and represents 14.3% of the HLA alleles in IMGT/HLA Database release 3.9.0. In particular, we identified 415 of these alleles as being “common” (having known frequencies) and 707 as being “well-documented” on the basis of ~140,000 sequence-based typing observations and available HLA haplotype data. Using these allele prevalence data, we have also assigned CWD status to specific G and P designations. We identified 147/151 G groups and 290/415 P groups as being CWD. The CWD catalogue will be updated on a regular basis moving forward, and will incorporate changes to the IMGT/HLA Database as well as empirical data from the histocompatibility and immunogenetics community. This version 2.0.0 of the CWD catalogue is available online at cwd.immunogenomics.org, and will be integrated into the Allele Frequencies Net Database, the IMGT/HLA Database and National Marrow Donor Program’s bioinformatics web pages.
The human leukocyte antigen (HLA) class I and class II loci are the most polymorphic genes in the human genome. Hematopoietic stem cell transplantation requires allele-level HLA typing at multiple loci to select the best matched unrelated donors for recipient patients. In current methods for HLA typing, both alleles of a heterozygote are amplified and typed or sequenced simultaneously, often making it difficult to unambiguously determine the sequence of the two alleles. Next-generation sequencing methods clonally propagate in parallel millions of single DNA molecules, which are then also sequenced in parallel. Recently, the read lengths obtainable by one such next-generation sequencing method (454 Life Sciences, Inc.) have increased to >250 nucleotides. These clonal read lengths make possible setting the phase of the linked polymorphisms within an exon and thus the unambiguous determination of the sequence of each HLA allele. Here we demonstrate this capacity as well as show that the throughput of the system is sufficiently high to enable a complete, 7-locus HLA class I and II typing for 24 or 48 individual DNAs in a single GS FLX sequencing run. Highly multiplexed amplicon sequencing is facilitated by the use of sample-specific internal sequence tags (multiplex identification tags or MIDs) in the primers that allow pooling of samples yet maintain the ability to assign sequences to specific individuals. We have incorporated an HLA typing software application developed by Conexio Genomics (Freemantle, Australia) that assigns HLA genotypes for these 7 loci (HLA-A, -B, -C, DRB1, DQA1, DQB1, DPB1), as well as for DRB3, DRB4, and DRB5 from 454 sequence data. The potential of this HLA sequencing system to analyze chimeric mixtures is demonstrated here by the detection of a rare HLA-B allele in a mixture of two homozygous cell lines (1/100), as well as by the detection of the rare nontransmitted maternal allele present in the blood of a severe combined immunodeficiency disease syndrome (SCIDS) patient.
The human leukocyte antigen (HLA) class I and class II loci are the most polymorphic genes in the human genome; distinguishing the thousands of HLA alleles is challenging. Next generation sequencing of exonic amplicons with the 454 genome sequence (GS) FLX System and Conexio Assign ATF 454 software provides high resolution, high throughput HLA genotyping for eight class I and class II loci. HLA typing of potential donors for unrelated bone marrow donor registries typically uses a subset of these loci at high sample throughput and low cost per sample. The Fluidigm Access Array System enables the incorporation of 48 different multiplex identifiers (MIDs) corresponding to 48 genomic DNA samples with up to 48 different primer pairs in a microfluidic device generating 2304 parallel polymerase chain reactions (PCRs). Minimal volumes of reagents are used. During genomic PCR, in this 4-primer system, the outer set of primers containing the MID and the 454 adaptor sequences are incorporated into an amplicon generated by the inner HLA target-specific primers each containing a common sequence tag at the 5' end of the forward and reverse primers. Pools of the resulting amplicons are used for emulsion PCR and clonal sequencing on the 454 Life Sciences GS FLX System, followed by genotyping with Conexio software. We have genotyped 192 samples with 100% concordance to known genotypes using 8 primer pairs (covering exons 2 and 3 of HLA-A, B and C, and exon 2 of DRB1, 3/4/5 and DQB1) and 96 MIDs in a single GS FLX run. An average of 166 reads per amplicon was obtained. We have also genotyped 96 samples at high resolution (14 primer pairs covering exons 2, 3, and 4 of the class I loci and exons 2 of DRB1, 3/4/5, DQA1, DQB1, DPB1, and exon 3 of DQB1), recovering an average of 173 sequence reads per amplicon.
The high degree of polymorphism at HLA class I and class II loci makes high resolution HLA typing challenging. Current typing methods, including Sanger sequencing, yield ambiguous typing results due to incomplete genomic coverage and inability to set phase for HLA haplotype determination. The 454 Life Sciences GS FLX next generation sequencing system coupled with Conexio ATF software can provide very high resolution HLA genotyping. High throughput genotyping can be achieved by use of primers with multiplex identifier (MID) tags to allow pooling of the amplicons generated from different individuals prior to sequencing. We have conducted a double blind study in which eight laboratory sites performed amplicon sequencing using GS FLX standard chemistry and genotyped the same 20 samples for HLA-A, -B, -C, DPB1, DQA1, DQB1, DRB1, and DRB3, DRB4 and DRB5 (DRB3/4/5) in a single sequencing run. The average sequence read length was 250 base pairs (bp) and the average number of sequence reads per amplicon was 672, providing confidence in the allele assignments. Of the 1280 genotypes considered, assignment was possible in 95% of the cases. Failure to assign genotypes was the result of researcher procedural error or the presence of a novel allele rather than a failure of sequencing technology. Concordance with known genotypes, in cases where assignment was possible, ranged from 95.3% to 99.4% for the eight sites, with overall concordance of 97.2%. We conclude that clonal pyrosequencing using the GS FLX platform and Conexio ATF software allows reliable identification of HLA genotypes at high resolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.