“…We characterized the probability to observe a clonal lineage ancestor in the periphery as post (σ) ∼ gen (σ) ]^: features`^( a) , which deviates from the inferred generation probability of the receptor gen (σ) by selection factors 5 ( ) (Isacchini et al, 2020a(Isacchini et al, , 2020b(Isacchini et al, , 2021Sethna et al, 2020). These selection factors 5 ( ) depend on sequence features, including IGHVgene and IGHJ-gene usages, HCDR3 length, and amino acid preferences at different positions in the HCDR3 (Methods) (Elhanati et al, 2014;Isacchini et al, 2020aIsacchini et al, , 2020bIsacchini et al, , 2021Marcou et al, 2018;Sethna et al, 2020). Importantly, the inferred selection models are robust to the differences in the sample size of the repertoires, as long as enough data is available to train the models (Methods and Fig.…”