We present SEQuence Weighted Alignment for Sorting and Harmonization (Seqwash), an algorithm designed to process sequencing profiles utilizing large language models (LLMs). Seqwash harmonizes immune cell sequences into a unified representation, empowering LLMs to embed meaningful patterns while eliminating irrelevant information. Evaluation using immune cell sequencing data showcases Seqwash’s efficacy in standardizing profiles, leading to improved fea- ture quality and enhanced performance in both supervised and unsupervised downstream tasks for sequencing data.