Adaptive immune receptor repertoires (AIRRs) have emerged as promising biomarkers for disease diagnosis and clinical prognosis. However, their high diversity and limited sharing between donors pose unique challenges when employing them as features for machine learning based diagnostics. In this study, we investigate the commonly used approach of representing each receptor as a member of a “clonotype cluster”. We then construct a feature vector for each donor from clonotype cluster frequencies (CCFs). We find that CCFs are sparse features and that classifiers trained on them do not generalize well to new donors. To overcome this limitation, we introduce a novel approach where we transform cluster frequencies using an adjacency matrix built from pairwise similarities of all receptors. This transformation produces a new feature, termed paratope cluster occupancies (PCOs). Leveraging publicly available AIRR datasets encompassing infectious disease (COVID-19, HIV), autoimmune disease (autoimmune hepatitis, type 1 diabetes), and cancer (colorectal cancer, non-small cell lung cancer), we demonstrate that PCOs exhibit lower sparsity compared to CCFs. Furthermore, we establish that classifiers trained on PCOs exhibited improved generalizability and overall classification performance (mean ROC AUC 0.893) when compared to CCFs (mean ROC AUC 0.714) over the six diseases. Our findings highlight the potential of utilizing PCOs as a feature representation for AIRR analysis in diverse disease contexts.