Deep Neural Networks Predict MHC-I Epitope Presentation and Transfer Learn Neoepitope Immunogenicity

Albert, Benjamin Alexander; Yang, Yunxiao; Shao, Xiaoshan; Singh, Douglas A.; Smith, Kellie N.; Anagnostou, Valsamo; Karchin, Rachel

doi:10.1101/2022.08.29.505690

Cited by 7 publications

(30 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To answer (1), we compare PerceiverpMHC with recent pseudo-sequence approaches and DeepAttentionPan (the single full sequence base approach ) on 6 datasets in section 1.1.In section 1.3 we show that the learned representations of RobustpMHC can be transferred to datasets that are even significantly different than the training dataset. Finally, we present ablation studies in section 1.4 to answer the research questions (2)(3)(4). With regard to (4), we show that across 8 different datasets RobustpMHC is either state-of-the-art approach or at par with state-of-the-art approach for that dataset in section 1.2.…”

Section: Resultsmentioning

confidence: 92%

“…We initially evaluate whether neural networks can inherently learn which amino acids are crucial for binding given the full MHC sequence or if we need to design hand-crated pseudo-sequences. To this end, we evaluate the performance of PerceiverpMHC on four publicly available benchmarks: independent and external set from Anthem [33] dataset, Neoantigen [10] and HPV [8] datasets and compare with the state-of-the-art pseudo-sequences based approaches like TransPHLA [10], capsNet [21], NetMHCpan 4.1 [44] and BigMHC [2]. The independent set from Anthem dataset [33] contains 112 types of HLA alleles, whereas the external set contains five HLA alleles.…”

Section: Full Sequence Evaluationmentioning

confidence: 99%

“…Neoepitope dataset: For evaluating the transfer learning capability of our approach, we follow the BigMHC [2]. We use the training (positive = 1,407; negative = 4,778), and validation (positive = 173; negative = 515) split for transfer learning our projection block keeping the weights of Perceiver IO block frozen.…”

Section: Datasetsmentioning

confidence: 99%

See 2 more Smart Citations

IMGT/RobustpMHC: Robust Training for class-I MHC Peptide Binding Prediction

Kushwaha,

Duroux,

Giudicelli

et al. 2023

Preprint

View full text Add to dashboard Cite

The accurate prediction of peptide-MHC class I binding probabilities is a critical endeavor in immunoinformatics, with broad implications for vaccine development and immunotherapies. While recent deep neural network based approaches have showcased promise in peptide-MHC prediction, they have two shortcomings: (i) they rely on hand-crafted pseudo-sequence extraction, (ii) they do not generalise well to different datasets, which limits the practicality of these approaches. In this paper, we present PerceiverpMHC that is able to learn accurate representations on full-sequences by leveraging efficient transformer based architectures. Additionally, we propose IMGT/RobustpMHC that harnesses the potential of unlabeled data in improving the robustness of peptide-MHC binding predictions through a self-supervised learning strategy. We extensively evaluate RobustpMHC on 8 different datasets and showcase the improvements over the state-of-the-art approaches. Finally, we compile CrystalIMGT, a crystallography verified dataset that presents a challenge to existing approaches due to significantly different peptide-MHC distributions.

show abstract

Section: Resultsmentioning

confidence: 92%

Section: Full Sequence Evaluationmentioning

confidence: 99%

See 1 more Smart Citation

IMGT/RobustpMHC: Robust Training for class-I MHC Peptide Binding Prediction

Kushwaha,

Duroux,

Giudicelli

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Strikingly, the model achieved the best performance on both benchmarks when only the HLA allele was used. The performance of HLA-only model on IEDB benchmark had greatly surpassed (AUROC:+3.18% ∼ +15.95%, AUPRC:+5.13% ∼ +14.01%) all 15 models evaluated in Ref [26] (Fig. 3E).…”

Section: Resultsmentioning

confidence: 99%

“…We conducted a 10-fold cross-validation in consistency with Ref [22]. In section 4.2, to ensure the consistency and fairness necessary for bias exploration and ablation study, we adopted the same dataset curated by Ref [26].…”

Section: Methodsmentioning

confidence: 99%

Discovering and overcoming the bias in neoantigen identification by unified machine learning models

Zhang,

Wu,

Wei

et al. 2024

Preprint

View full text Add to dashboard Cite

Neoantigens, formed by genetic mutations in tumor cells, are abnormal peptides that can trigger immune responses. Precisely identifying neoantigens from vast mutations is the key to tumor immunotherapy design. There are three main steps in the neoantigen immune process, i.e., binding with MHCs, extracellular presentation, and induction of immunogenicity. Various machine learning methods have been developed to predict the probability of one of the three events, but the overall accuracy of neoantigen identification remains far from satisfactory. To gain a systematic understanding of the key factors of neoantigen identification, we developed a unified transformer-based machine learning framework ImmuBPI that comprised three tasks and achieved state-of-the-art performance. Through cross-task model interpretation, we have discovered an underestimation of data bias for immunogenicity prediction, which has led to skewed discriminatory boundaries of current machine learning models. We designed a mutual information-based debiasing strategy that performed well on mutation variants immunogenicity prediction, a task where current methods fell short. Clustering immunogenic peptides with debiased representations uncovers unique preferences for biophysical properties, such as hydrophobicity and polarity. These observations serve as an important complement to the past understanding that accurately predicting neoantigen is constrained by limited data, highlighting the necessity of bias control. We expect this study will provide novel and insightful perspectives for neoantigen prediction methods and benefit future neoantigen-mediated immunotherapy designs.

show abstract