“…10 We leave additional NLI datasets, such as the Diverse NLI Collection (Poliak et al, 2018a), for future work. 11 Many NLI models encode P and H separately (Rocktäschel et al, 2016;Mou et al, 2016;Liu et al, 2016;Cheng et al, 2016;Chen et al, 2017), although some share information between the encoders via attention Duan et al, 2018). 12 Specifically, representations are concatenated, subtracted, and multiplied element-wise.…”