“…Besides, NLI systems relying on superficial syntactic properties (e.g., the lexical overlap heuristic, the subsequence heuristic, the constituent heuristic) may succeed on the majority of examples McCoy et al, 2019;Clark et al, 2019;Utama et al, 2020;Pezeshkpour et al, 2021). On quite a few NLP tasks composed of several components, it has been observed that models fed with partial input can achieve competitive performance compared with those feeding with full input, e.g., leveraging claims without evidence for fact verification (Schuster et al, 2019;Utama et al, 2020;Du et al, 2021b), choosing a plausible story ending for in the narrative cloze test without looking at the story (Cai et al, 2017), question answering based biased positional predictions on the reference document (Jia and Liang, 2017;Kaushik and Lipton, 2018), selecting the appropriate warrant with claims only (without reason) in argument reasoning comprehension (Niven and Kao, 2019;Branco et al, 2021), etc. This paper is the first work to diagnose existing entity typing models whether they have exploited spurious correlations, including the common lexical and the partial input (mention-context) bias, and the task-specific dependency bias.…”