2019
DOI: 10.1101/727784
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

sumrep: a summary statistic framework for immune receptor repertoire comparison and model validation

Abstract: The adaptive immune system generates an incredible diversity of antigen receptors for B and T cells to keep dangerous pathogens at bay. The DNA sequences coding for these receptors arise by a complex recombination process followed by a series of productivity-based filters, as well as affinity maturation for B cells, giving considerable diversity to the circulating pool of receptor sequences. Although these datasets hold considerable promise for medical and public health applications, the complex structure of t… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 37 publications
0
11
0
Order By: Relevance
“…We next sought to quantify the similarity of model-generated sequences to real sequences, for each of the three models in consideration. To accomplish this task, we used the sumrep package (Olson et al, 2019) (https://github.com/matsengrp/sumrep/), a collaborative effort of the AIRR (Breden et al, 2017; Rubelt et al, 2017) software working group. This package calculates many summary statistics on immune receptor sequence repertoires and provides functions for comparing these summaries.…”
Section: Resultsmentioning
confidence: 99%
“…We next sought to quantify the similarity of model-generated sequences to real sequences, for each of the three models in consideration. To accomplish this task, we used the sumrep package (Olson et al, 2019) (https://github.com/matsengrp/sumrep/), a collaborative effort of the AIRR (Breden et al, 2017; Rubelt et al, 2017) software working group. This package calculates many summary statistics on immune receptor sequence repertoires and provides functions for comparing these summaries.…”
Section: Resultsmentioning
confidence: 99%
“…[1]Two such approaches have been proposed for specific clone detection in Minimal Residual Diseases 45,46 as well as for the BCR, but not TCR, repertoire 47 , still at a very low diversity level. The construction of such gold standard repertoires is currently very costly and remains a major challenge that the Adaptive Immune Receptor Repertoire Community (AIRR-C) 48 , engaged in AIRR-seq standardization [49][50][51] , may tackle in the future. Finally, in this study some data were pre-processed using proprietary (mPCR-1, mPCR-3) or published 30,52 (RACE-1_U and RACE-2_U) tools and then aligned and error-corrected using MiXCR (v2.1.10) 37 .…”
Section: Detection Sensitivity Of Rare Tcrs Depends On the Methodsmentioning
confidence: 99%
“…For RACE-1 and RACE-2, UMI pre-processing was performed following protocols published elsewhere 29,30 . FASTQ and FASTA files were then processed for TRB and TRA sequence annotation using the MiXCR software (v2.1.10) with RNA-Seq parameters (-p rna-seq -s hsa) 50 .…”
Section: Tcr Deep Sequencing Data Processingmentioning
confidence: 99%
“…While these efforts raise concerns over the validity of DE cells as a biomarker or mechanism of T1D pathogenesis, our data provide an opportunity for further discovery. The data can be used to determine whether potential immune cell or immune repertoire motifs are associated with T1D, to compare against libraries of antibodies with known auto-specificities (Seay et al, 2016) as well as for sample normalization, motif discovery, and disease-specific immune subset analysis (Olson et al, 2019;Miho et al, 2019). Another interesting and under-explored aspect of our data is the analysis of the Aab + control subjects, who have broken tolerance to a subset of T1D-associated autoantigens (Battaglia et al, 2020).…”
Section: Discussionmentioning
confidence: 99%