“…To handle the problem, researchers (Zeng et al, 2018(Zeng et al, , 2019Nayak and Ng, 2020;Ye et al, 2021) propose generative models that view triple as a token sequence. Some works Ren et al, 2021;Shang et al, 2022) introduce methods to extract triples in one-stage but suffer from huge prediction space. Other researchers (Wei et al, 2020;Zheng et al, 2021b;Wu and Shi, 2021) decompose RTE into different subtasks, but learn the interaction between sub-tasks only by input sharing, or falling into the cascade error.…”