“…This enables a more well-defined task, since determining the truthfulness of a fact w.r.t a Task # Examples Open Test Cons. Summarization -FRANK (Pagnoni et al, 2021) 671 + 33.2% -SummEval (Fabbri et al, 2021a) 1,600 -81.6% -MNBM (Maynez et al, 2020) 2,500 -10.2% -QAGS-CNNDM 235 -48.1% -QAGS-XSum 239 -48.5% Dialogue -BEGIN (Dziri et al, 2021) 836 + 33.7% -Q 2 (Honovich et al, 2021) 1,088 -57.7% -DialFact (Gupta et al, 2021) 8,689 + 38.5% Fact Verification -FEVER (Thorne et al, 2018) 18,209 -35.1% -VitaminC (Schuster et al, 2021) 63,054 + 49.9% Paraphrasing -PAWS (Zhang et al, 2019) 8,000 + 44.2% general "real world" is subjective and depends on the knowledge, values and beliefs of the subject (Heidegger, 2001). This definition follows similar strictness in Textual Entailment, Question Answering, Summarization and other tasks where comprehension is based on a given grounding text, irrespective of contradiction with other world knowledge.…”