“…Text classification in the education domain is reportedly difficult as the tags (or, labels) are hierarchical (Xu et al, 2019;Goel et al, 2022;Mohania et al, 2021), grow flexibly, and can be multi-labeled (Medini et al, 2019;Dekel and Shamir, 2010). Though retrieval-based methods were effective for such long-tailed and multilabel datasets (Zhang et al, 2022;, they relied on vanilla BERT (Devlin et al, 2018) models, leaving room for improvement, for which we leverage question-answering fine-tuned retrieval models (Karpukhin et al, 2020).…”