2021
DOI: 10.1101/2021.02.22.432291
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

clusTCR: a Python interface for rapid clustering of large sets of CDR3 sequences

Abstract: The T-cell receptor (TCR) determines the specificity of a T-cell towards an epitope. As of yet, the rules for antigen recognition remain largely undetermined. Current methods for grouping TCRs according to their epitope specificity remain limited in performance and scalability. Multiple methodologies have been developed, but all of them fail to efficiently cluster large data sets exceeding 1 million sequences. To account for this limitation, we developed clusTCR, a rapid TCR clustering alternative that efficie… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…Second, those that seek to classify TCRs by identifying similar to known antigen-specific or disease-specific TCR clonotypes. The former algorithms such as TCRdist ( 111 ), GLIPH/GLIPH2 ( 117 , 118 ), ClusTCR ( 119 ), and GIANA ( 108 ) do not need information about T1D-specific epitopes and TCR sequences beforehand, and thus can be used to predict disease-specific TCR clonotypes that are specifically detected in T1D patients but not in non-diabetic subjects. On the other hand, machine learning-based algorithms that assess similarities to known antigen-specific TCR datasets to predict epitopes, such as DeepTCR ( 101 ), DeepCAT ( 107 ), TCRmatch ( 120 ), and TCRAI ( 100 ) need prior information about disease-specific TCR sequences.…”
Section: Use Of Tcr Clonotypes As Surrogates To Quantify Antigen-specific T Cellsmentioning
confidence: 99%
“…Second, those that seek to classify TCRs by identifying similar to known antigen-specific or disease-specific TCR clonotypes. The former algorithms such as TCRdist ( 111 ), GLIPH/GLIPH2 ( 117 , 118 ), ClusTCR ( 119 ), and GIANA ( 108 ) do not need information about T1D-specific epitopes and TCR sequences beforehand, and thus can be used to predict disease-specific TCR clonotypes that are specifically detected in T1D patients but not in non-diabetic subjects. On the other hand, machine learning-based algorithms that assess similarities to known antigen-specific TCR datasets to predict epitopes, such as DeepTCR ( 101 ), DeepCAT ( 107 ), TCRmatch ( 120 ), and TCRAI ( 100 ) need prior information about disease-specific TCR sequences.…”
Section: Use Of Tcr Clonotypes As Surrogates To Quantify Antigen-specific T Cellsmentioning
confidence: 99%
“…Building a sequence similarity network is computationally expensive. This challenge has been approached by at least two methods that allow the construction of large-scale networks from millions of AIRR sequences [47,51].…”
Section: Similarity Of Airr Sequencesmentioning
confidence: 99%