“…characterization of CDR3 repertoires remains a great challenge. To date, several tools, such as MiXCR (Bolotin et al 2017), RTCR (Gerritsen et al 2016), IMSEQ (Kuchenbecker et al 2015), LymAnalyzer (Yu et al 2016), TraCeR (Stubbington et al 2016), and TRUST (Hu et al 2019), have been developed for to characterizing CDR3 repertoires in TCR-Seq and/or RNA-Seq data. However, these methods have some limitations and require improvements in terms of 1) discarding reads without complete CDR3s or with low frequency, which may cause massive loss of CDR3 sequences and bias toward TCR repertoires, 2) parameter sensitivity and reliance on hands-on settings for model optimization, 3) poor performance in datasets with short read length and low TCR content, and 4) excessive consumption of time and computational resource, which roadblocks the usage of these methods To overcome these limitations, we developed a sensitive and accurate method, named CATT (CharActerizing TCR reperToires, http://bioinfo.life.hust.edu.cn/CATT), for characterizing CDR3 repertoires in both bulk and single-cell TCR(RNA)-Seq datasets.…”