Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021
DOI: 10.18653/v1/2021.emnlp-main.307
|View full text |Cite
|
Sign up to set email alerts
|

WinoLogic: A Zero-Shot Logic-based Diagnostic Dataset for Winograd Schema Challenge

Abstract: The recent success of neural language models (NLMs) on the Winograd Schema Challenge has called for further investigation of the commonsense reasoning ability of these models. Previous diagnostic datasets rely on crowd-sourcing which fails to provide coherent commonsense crucial for solving WSC problems. To better evaluate NLMs, we propose a logic-based framework that focuses on highquality commonsense knowledge. Specifically, we identify and collect formal knowledge formulas verified by theorem provers and tr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 19 publications
0
4
0
1
Order By: Relevance
“…sentences Winograd Schema Pronoun resolution 285 sentences Expert construction Challenge (WSC) [82] Winogrande [115] Cloze task 44,000 Crowd sourcing. Winologic [58] Logical explanation 273 problems Expert construction of pronoun resolution Winowhy [157] Justify pronoun 276 questions Crowd sourcing resolution Winventor [66] Pronoun resolution 848 sentences Synthesized. WordCraft [68] Construct new 3147 combinations Synthesized from entity from old ones existing resource ZUCC [153] Predict next observation 14,000 examples Synthesized in interactive fiction from interactive fiction…”
Section: Taskmentioning
confidence: 99%
“…sentences Winograd Schema Pronoun resolution 285 sentences Expert construction Challenge (WSC) [82] Winogrande [115] Cloze task 44,000 Crowd sourcing. Winologic [58] Logical explanation 273 problems Expert construction of pronoun resolution Winowhy [157] Justify pronoun 276 questions Crowd sourcing resolution Winventor [66] Pronoun resolution 848 sentences Synthesized. WordCraft [68] Construct new 3147 combinations Synthesized from entity from old ones existing resource ZUCC [153] Predict next observation 14,000 examples Synthesized in interactive fiction from interactive fiction…”
Section: Taskmentioning
confidence: 99%
“…Algumas alternativas encontradas para a ampliação do volume de esquemas são o Winoflexi [Isaak;Michael, 2019], que utiliza crowdsourcing para o desenvolvimento de novas sentenças e o Winventor [Nicos;Michael, 2020] que busca automatizar a criação de esquemas. No Winologic [He et al, 2021] novas frases foram construídas utilizando teoremas lógicos.…”
Section: Winograd E a Evolução Dos Benchmarksunclassified
“…WinoWhy and WinoLogic (He et al, 2021) are collections of correct and incorrect natural language explanations of all Winograd Schemas in the Wsc273 dataset. The goal of both datasets is to determine whether systems that can correctly answer Winograd Schemas are also capable to identify the correct explanations for their choice.…”
Section: Datasets Of Explanationsmentioning
confidence: 99%