“…Distant supervision has been used to train MRC models in low-resource settings, and two main kinds of approaches have been proposed to address the mislabeling problem: (1) filtering noisy labels, and (2) modeling answer spans as latent variables. The noise filtering approaches learn to score and rank DS instances based on answer span positions Tay et al, 2018;Swayamdipta et al, 2018;Clark and Gardner, 2018;Lin et al, 2018;Joshi et al, 2017;Chen et al, 2017), question-passage similarities (Hong et al, 2022;Qin et al, 2021;Shao et al, 2021;Deng et al, 2021) and model confidences Zhu et al, 2022). The latent variable-based approaches jointly train MRC models and identify correct answer spans using hard-EM algorithms (Zhao et al, 2021;Min et al, 2019;Cheng et al, 2020).…”