2021
DOI: 10.3389/fimmu.2021.627813
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning Analysis of Naïve B-Cell Receptor Repertoires Stratifies Celiac Disease Patients and Controls

Abstract: Celiac disease (CeD) is a common autoimmune disorder caused by an abnormal immune response to dietary gluten proteins. The disease has high heritability. HLA is the major susceptibility factor, and the HLA effect is mediated via presentation of deamidated gluten peptides by disease-associated HLA-DQ variants to CD4+ T cells. In addition to gluten-specific CD4+ T cells the patients have antibodies to transglutaminase 2 (autoantigen) and deamidated gluten peptides. These disease-specific antibodies recognize def… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
36
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 37 publications
(37 citation statements)
references
References 61 publications
(74 reference statements)
1
36
0
Order By: Relevance
“…Recent IgH repertoire studies have moved towards using machine learning and artificial intelligence in contrast to traditional statistical approaches for goals including vaccine design, immunodiagnostics and antibody discovery ( Greiff et al, 2017 ; Ostmeyer et al, 2017 ; Konishi et al, 2019 ; Shemesh et al, 2021 ). Previous work has focused on representing repertoires as sequence or subsequence-based features, that is, overlapping amino acid k-mers and their Atchley biophysiochemical properties ( Greiff et al, 2017 ; Ostmeyer et al, 2017 ).…”
Section: Discussionmentioning
confidence: 99%
“…Recent IgH repertoire studies have moved towards using machine learning and artificial intelligence in contrast to traditional statistical approaches for goals including vaccine design, immunodiagnostics and antibody discovery ( Greiff et al, 2017 ; Ostmeyer et al, 2017 ; Konishi et al, 2019 ; Shemesh et al, 2021 ). Previous work has focused on representing repertoires as sequence or subsequence-based features, that is, overlapping amino acid k-mers and their Atchley biophysiochemical properties ( Greiff et al, 2017 ; Ostmeyer et al, 2017 ).…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning is increasingly used for AIRR classification both on the sequence (Akbar et al 2021;Friedensohn et al 2020;Isacchini et al 2021;Greiff et al 2017b) and repertoire-level (Shemesh et al 2021;Emerson et al 2017;Sidhom et al 2021;Pavlović et al 2021). Future studies will need to investigate whether differences in RGM also impact repertoire classification (Greiff et al 2020).…”
Section: Discussionmentioning
confidence: 99%
“…In the future, it will be interesting to study whether individual differences in RGMP lead to differences in the propensity to generate antigen-specific (for instance auto-reactive) sequences (Shemesh et al 2021) and, consequently, to the existence of individualized holes in the repertoire (Perelson and Oster 1979). These analyses will require large-scale naïve (unselected) and disease-linked AIRR-seq data (Watson, Glanville, and Marasco 2017).…”
Section: Discussionmentioning
confidence: 99%
“…CDR3 sequences were embedded into R 100 , yielding a matrix of size [number of sequences × 100] where each row r i represents a sequence i and each column C j represents the value of the embedding in dimension j(j ∈ {1…100}). To characterize whole repertoires using CDR3 embedded vectors we applied an analogous clustering approach to the one used in ( 11) and (24). Vectors were first grouped according to their V-gene, J-gene, and CDR3 length.…”
Section: Stratifying Hcv-specific B-cell Repertoires Using Immune2vecmentioning
confidence: 99%