for non-commercial research purposes and access will be granted upon review of a project proposal that will be evaluated by a TRACERx data access committee and entering into an appropriate data access agreement subject to any applicable ethical approvals. The TCRseq Fastq data was deposited at the short read archive (SRA) under accession code UB4501422.
The T cell receptor (TCR) repertoire can provide a personalized biomarker for infectious and non-infectious diseases. We describe a protocol for amplifying, sequencing, and analyzing TCRs which is robust, sensitive, and versatile. The key experimental step is ligation of a single-stranded oligonucleotide to the 3′ end of the TCR cDNA. This allows amplification of all possible rearrangements using a single set of primers per locus. It also introduces a unique molecular identifier to label each starting cDNA molecule. This molecular identifier is used to correct for sequence errors and for effects of differential PCR amplification efficiency, thus producing more accurate measures of the true TCR frequency within the sample. This integrated experimental and computational pipeline is applied to the analysis of human memory and naive subpopulations, and results in consistent measures of diversity and inequality. After error correction, the distribution of TCR sequence abundance in all subpopulations followed a power law over a wide range of values. The power law exponent differed between naïve and memory populations, but was consistent between individuals. The integrated experimental and analysis pipeline we describe is appropriate to studies of T cell responses in a broad range of physiological and pathological contexts.
Accurate profiling of T-cell receptor (TCR) repertoires is key to monitoring adaptive immunity.We systematically compared TCR sequences obtained with 9 methods applied to aliquots of the same T-cell sample. We observed marked differences in accuracy and intra-and intermethod reproducibility for alpha (TRA) and beta (TRB) TCR chains. Most methods showed lower ability to capture TRA than TRB diversity. Low RNA input generated non-representative repertoires. Results from 5'RACE-PCR methods were consistent among themselves, while differing from the RNA-based multiplex-PCR results. gDNA-based multiplex-PCR methods also differed from each other. Using an in silico meta-repertoire generated from 108 replicates, we found that one gDNA-based method and two non-UMI RNA-based methods were more sensitive than UMI methods in detecting rare clonotypes, despite the better clonotype quantification accuracy of the latter. This study delineates the advantages and limitations of different TCR sequencing methods, which should help the study, diagnosis and treatment of human diseases.
HIV infection profoundly affects many parameters of the immune system and ultimately leads to AIDS, yet which factors are most important for determining resistance, pathology, and response to antiretroviral treatment – and how best to monitor them – remain unclear. We develop a quantitative high-throughput sequencing pipeline to characterize the TCR repertoires of HIV-infected individuals before and after antiretroviral therapy, working from small, unfractionated samples of peripheral blood. This reveals the TCR repertoires of HIV+ individuals to be highly perturbed, with considerably reduced diversity as a small proportion of sequences are highly overrepresented. HIV also causes specific qualitative changes to the repertoire including an altered distribution of V gene usage, depletion of public TCR sequences, and disruption of TCR networks. Short-term antiretroviral therapy has little impact on most of the global damage to repertoire structure, but is accompanied by rapid changes in the abundance of many individual TCR sequences, decreases in abundance of the most common sequences, and decreases in the majority of HIV-associated CDR3 sequences. Thus, high-throughput repertoire sequencing of small blood samples that are easy to take, store, and process can shed light on various aspects of the T-cell immune compartment and stands to offer insights into patient stratification and immune reconstitution.
T-cell specificity is determined by the T-cell receptor, a heterodimeric protein coded for by an extremely diverse set of genes produced by imprecise somatic gene recombination. Massively parallel high-throughput sequencing allows millions of different T-cell receptor genes to be characterized from a single sample of blood or tissue. However, the extraordinary heterogeneity of the immune repertoire poses significant challenges for subsequent analysis of the data. We outline the major steps in processing of repertoire data, considering low-level processing of raw sequence files and high-level algorithms, which seek to extract biological or pathological information. The latest generation of bioinformatics tools allows millions of DNA sequences to be accurately and rapidly assigned to their respective variable V and J gene segments, and to reconstruct an almost error-free representation of the non-templated additions and deletions that occur. High-level processing can measure the diversity of the repertoire in different samples, quantify V and J usage and identify private and public T-cell receptors. Finally, we discuss the major challenge of linking T-cell receptor sequence to function, and specifically to antigen recognition. Sophisticated machine learning algorithms are being developed that can combine the paradoxical degeneracy and cross-reactivity of individual T-cell receptors with the specificity of the overall T-cell immune response. Computational analysis will provide the key to unlock the potential of the T-cell receptor repertoire to give insight into the fundamental biology of the adaptive immune system and to provide powerful biomarkers of disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.