Immunoglobulins or antibodies are the main effector molecules of the B-cell lineage and are encoded by hundreds of variable (V), diversity (D), and joining (J) germline genes, which recombine to generate enormous IG diversity. Recently, high-throughput adaptive immune receptor repertoire sequencing (AIRR-seq) of recombined V-(D)-J genes has offered unprecedented insights into the dynamics of IG repertoires in health and disease. Faithful biological interpretation of AIRR-seq studies depends upon the annotation of raw AIRR-seq data, using reference germline gene databases to identify the germline genes within each rearrangement. Existing reference databases are incomplete, as shown by recent AIRR-seq studies that have inferred the existence of many previously unreported polymorphisms. Completing the documentation of genetic variation in germline gene databases is therefore of crucial importance. Lymphocyte receptor genes and alleles are currently assigned by the Immunoglobulins, T cell Receptors and Major Histocompatibility Nomenclature Subcommittee of the International Union of Immunological Societies (IUIS) and managed in IMGT®, the international ImMunoGeneTics information system® (IMGT). In 2017, the IMGT Group reached agreement with a group of AIRR-seq researchers on the principles of a streamlined process for identifying and naming inferred allelic sequences, for their incorporation into IMGT®. These researchers represented the AIRR Community, a network of over 300 researchers whose objective is to promote all aspects of immunoglobulin and T-cell receptor repertoire studies, including the standardization of experimental and computational aspects of AIRR-seq data generation and analysis. The Inferred Allele Review Committee (IARC) was established by the AIRR Community to devise policies, criteria, and procedures to perform this function. Formalized evaluations of novel inferred sequences have now begun and submissions are invited via a new dedicated portal (https://ogrdb.airr-community.org). Here, we summarize recommendations developed by the IARC—focusing, to begin with, on human IGHV genes—with the goal of facilitating the acceptance of inferred allelic variants of germline IGHV genes. We believe that this initiative will improve the quality of AIRR-seq studies by facilitating the description of human IG germline gene variation, and that in time, it will expand to the documentation of TR and IG genes in many vertebrate species.
In vitro generation of antibodies often requires variable domain sequence evolution to adapt the protein in terms of affinity, specificity, or developability. Such antibodies, including those that are of interest for clinical development, may have their origins in a diversity of immunoglobulin germline genes. Others and we have previously shown that antibodies of different origins tend to evolve along different, preferred trajectories. Apart from substitutions within the complementary determining regions, evolution may also, in a germline gene-origin-defined manner, be focused to residues in the framework regions, and even to residues within the protein core, in many instances at a substantial distance from the antibody’s antigen-binding site. Examples of such germline origin-defined patterns of evolution are described. We propose that germline gene-preferred substitution patterns offer attractive alternatives that should be considered in efforts to evolve antibodies intended for therapeutic use with respect to appropriate affinity, specificity, and product developability. We also hypothesize that such germline gene-origin-defined in vitro evolution hold potential to result in products with limited immunogenicity, as similarly evolved antibodies will be parts of conventional, in vivo-generated antibody responses and thus are likely to have been seen by the immune system in the past.
Sequencing of immunoglobulin germline gene loci is a challenging process, e.g. due to their repetitiveness and complexity, hence limiting the insight in the germline gene repertoire of humans and other species. Through next generation sequencing technology, it is possible to generate immunoglobulin transcript data sets large enough to computationally infer the germline genes from which the transcripts originate. Multiple tools for such inference have been developed and they can be used for construction of individual germline gene databases, and for discovery of new immunoglobulin germline genes and alleles. However, there are challenges associated with these methods, many of them related to the biological process through which immunoglobulin coding genes are generated. The junctional diversity introduced during rearrangement of the immunoglobulin heavy chain variable (IGHV), diversity and joining genes specifically complicates the inference of the junction regions, with implications for inference of the 3'-end of IGHV genes. With the aim of coping with such diversity, an inference software package may not be able to identify novel alleles harbouring a difference in these regions compared to their closest relatives in the starting database. In this study, we were able to computationally infer one such previously uncharacterized allele, IGHV3-7*02 A318G. However, this was possible only if a strategy was used in which different variants of IGHV3-7*02 were included in the inference-initiating database. Importantly, the presence of the novel allele, but not the standard IGHV3-7*02 sequence, in the genotype was strongly supported by the actual sequences that were assigned to the allele. We thus showed that the starting database used will impact the germline gene inference process, and that difference in the 3'-end of IGHV genes may remain undetected unless specific, non-standard procedures are used to address this matter. We suggest that inferred genes/alleles should be confirmed e.g. by examination of the nucleotide composition of the 3'-bases of the inference-supporting sequence reads.
Inference of antibody gene repertoires using transcriptome data has emerged as an alternative approach to the complex process of sequencing of adaptive immune receptor germline gene loci. The diversity introduced during rearrangement of immunoglobulin heavy chain variable (IGHV), diversity, and joining genes has however been identified as potentially affecting inference specificity. In this study, we have addressed this issue by analysing the nucleotide composition of unmutated human immunoglobulin heavy chains-encoding transcripts, focusing on the 3ö most bases of 47 IGHV germline genes. Although transcripts derived from some of the germline genes predominately incorporated the germline encoded base even at position 320, the last base of most IGHV genes, transcripts originating in other genes presented other nucleotides to the same extent at this position. In transcripts derived from two of the germline genes, IGHV3-13*01 and IGHV4-30-2*01, the predominating nucleotide (G) was in fact not that of the gene (A). Hence, we suggest that inference of IGHV genes should be limited to bases preceding nucleotide 320, as inference beyond this would jeopardize the specificity of the inference process. The different degree of incorporation of the final base of the IGHV gene directly influences the distribution of amino acids of the ascending strand of the third complementarity determining region of the heavy chain. Thereby it influences the nature of this specificity-determining part of the antibody population. In addition, we also present data that indicate the existence of a common so far un-recognized allelic variant of IGHV3-7 that carries an A318G difference in relation to IGHV3-7*02.
Allergic diseases affect many individuals world-wide and are dependent on the interaction between allergens and antibodies of the IgE isotype. Allergen-specific immunotherapy (AIT) can alter the development of the disease, e.g., through induction of allergen-specific IgG that block allergen-IgE interactions. The knowledge of epitopes recognized by allergy-causing and protective antibodies are limited. Therefore, we developed an allergome-wide peptide microarray, aiming to track linear epitope binding patterns in allergic diseases and during AIT. Here, we focused on immune responses to grass pollen allergens and found that such epitopes were commonly recognized before initiation of AIT and that AIT commonly resulted in increased antibody production against additional epitopes already after 1 year of treatment. The linear epitope binding patterns were highly individual, both for subjects subjected to and for individuals not subjected to AIT. Still, antibodies against some linear epitopes were commonly developed during AIT. For example, the two rigid domains found in grass pollen group 5 allergens have previously been associated to a diversity of discontinuous epitopes. Here, we present evidence that also the flexible linker, connecting these domains, contains regions of linear epitopes against which antibodies are developed during AIT. We also describe some commonly recognized linear epitopes on Phl p 2 and suggest how antibodies against these epitopes may contribute to or prevent allergy in relation to a well-defined stereotyped/public IgE response against the same allergen. Finally, we identify epitopes that induce cross-reactive antibodies, but also antibodies that exclusively bind one of two highly similar variants of a linear epitope. Our findings highlight the complexity of antibody recognition of linear epitopes, with respect to both the studied individuals and the examined allergens. We expect that many of the findings in this study can be generalized also to discontinuous epitopes and that allergen peptide microarrays provide an important tool for enhancing the understanding of allergen-specific antibodies in allergic disease and during AIT.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.