A compact vocabulary of paratope-epitope interactions enables predictability of antibody-antigen binding

Akbar, Rahmad; Jeliazkov, Jeliazko R.; Robert, Philippe A.; Snapkov, Igor; Pavlović, Milena; Slabodkin, Andrei; Weber, Cédric R.; Safonova, Yana; Sandve, Geir Kjetil; Greiff, Victor

doi:10.1101/759498

Cited by 33 publications

(78 citation statements)

References 144 publications

(176 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This necessitates relevant feature encoding. Indeed, studies have shown that feature engineering is essential for the prediction of paratope-epitope binding (22) and that it can also improve the performance of models for more general bioinformatics problems (23). Moreover, when complex machine learning methods are applied to small datasets, the dangers of overfitting and memorization of examples pose an even bigger threat.…”

Section: Introductionmentioning

confidence: 99%

Current challenges for epitope-agnostic TCR interaction prediction and a new perspective derived from image classification

Moris

Pauw

Postovskaya

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

Current challenges for epitope-agnostic TCR interaction prediction and a new perspective derived from image classification

Moris

Pauw

Postovskaya

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

“…from sequence data alone, is much more difficult. It would involve de novo structure and binding prediction, which are not currently practical, although much recent work focuses on these problems [56][57][58][59][60].…”

Section: Discussionmentioning

confidence: 99%

Using B cell receptor lineage structures to predict affinity

Ralph

Matsen

2020

Preprint

View full text Add to dashboard Cite

We are frequently faced with a large collection of antibodies, and want to select those with highest affinity for their cognate antigen. When developing a first-line therapeutic for a novel pathogen, for instance, we might look for such antibodies in patients that have recovered. There exist effective experimental methods of accomplishing this, such as cell sorting and baiting; however they are time consuming and expensive. Next generation sequencing of B cell receptor (BCR) repertoires offers an additional source of sequences that could be tapped if we had a reliable method of selecting those coding for the best antibodies. In this paper we introduce a method that uses evolutionary information from the family of related sequences that share a naive ancestor to predict the affinity of each resulting antibody for its antigen. When combined with information on the identity of the antigen, this method should provide a source of effective new antibodies. We also introduce a method for a related task: given an antibody of interest and its inferred ancestral lineage, which branches in the tree are likely to harbor key affinity-increasing mutations? These methods are implemented as part of continuing development of the partis BCR inference package, available at https://github.com/psathyrella/partis. Comments.Please post comments or questions on this paper as new issues at https://git.io/ Jvxkn.

show abstract

“…Each simulated repertoire was then randomly assigned to either the positive or negative class, with 2, 500 repertoires per class. In the repertoires assigned to the positive class, we implanted motifs with an average length of 4 AAs, following the results of the experimental analysis of antigenbinding motifs in antibodies and T-cell receptor sequences by (Akbar et al, 2019). We varied the characteristics of the implanted motifs for each of the 18 datasets with respect to the following parameters: (a) ρ, the probability of a motif being implanted in a sequence of a positive repertoire, i.e.…”

Section: A31 Simulated Immunosequencing Datamentioning

confidence: 99%