“…Adaptive immune receptor repertoires represent a major target area for the application of machine learning in the hope that it may fast-track the in silico discovery and development of immunereceptor based immunotherapies and immunodiagnostics (Brown et al, 2019;Greiff et al, 2012;Mason et al, 2018Mason et al, , 2019Miho et al, 2018). The complexity of sequence dependencies that determine antigen binding (Dash et al, 2017;Glanville et al, 2017), immune receptor publicity (Greiff et al, 2017b) and immune status (immunodiagnostics) (Ostmeyer et al, 2019;Thomas et al, 2014) represent a perfect application ground for machine learning analysis (Arora et al, 2019;Cinelli et al, 2017;Greiff et al, 2017b;Liu et al, 2019;Mason et al, 2019;Sidhom et al, 2018;Sun et al, 2017). As discussed extensively in a recent literature review by us (Brown et al, 2019) the development of ML approaches for immune receptor datasets was and is still hampered by the lack of ground truth datasets.…”