“…Our work is the first to recover reasoning chains in a more general unsupervised setting, thus falling into the direction of denoising over distant supervised signals. From this perspective, the most relevant studies in the NLP field includes Wang, Yu, Guo, Wang, Klinger, Zhang, Chang, Tesauro, Zhou, and Jiang [21] and Min, Chen, Hajishirzi, and Zettlemoyer [22] for evidence identification in opendomain QA and Lei, Barzilay, and Jaakkola [5] and Perez, Karamcheti, Fergus, Weston, Kiela, and Cho [23] for rationale recovery.…”