2022
DOI: 10.48550/arxiv.2209.07760
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Possible Stories: Evaluating Situated Commonsense Reasoning under Multiple Possible Scenarios

Abstract: The possible consequences for the same context may vary depending on the situation we refer to. However, current studies in natural language processing do not focus on situated commonsense reasoning under multiple possible scenarios. This study frames this task by asking multiple questions with the same set of possible endings as candidate answers, given a short story text. Our resulting dataset, Possible Stories, consists of more than 4.5K questions over 1.3K story texts in English. We discover that even curr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…identifying the rhyme scheme in a poem or increasing the contrast in an image. 6 In the other direction, there can be evidence of various kinds, beyond performance on a commonsense benchmark, that an AI system does or does not use commonsense reasoning in carrying out a task. In a knowledge-based system, one can often see what knowledge is being used and how; if commonsense knowledge is being used in a significant way in carrying out a particular task, then presumably this is to some extent a commonsense task.…”
Section: An Untrue Claim About Commonsense Knowledgementioning
confidence: 99%
See 1 more Smart Citation
“…identifying the rhyme scheme in a poem or increasing the contrast in an image. 6 In the other direction, there can be evidence of various kinds, beyond performance on a commonsense benchmark, that an AI system does or does not use commonsense reasoning in carrying out a task. In a knowledge-based system, one can often see what knowledge is being used and how; if commonsense knowledge is being used in a significant way in carrying out a particular task, then presumably this is to some extent a commonsense task.…”
Section: An Untrue Claim About Commonsense Knowledgementioning
confidence: 99%
“…Size Construction PIQA [12] Physical interaction QA 20,000 questions Crowd sourcing Possible Stories [6] Counterfactual 1313 texts Crowd sourcing narratives 4533 questions PROST [5] Physical reasoning 18,736 questions Expert-written cloze task template. ProtoQA [15] Reasoning about 9700 questions Crowd sourcing prototypical situations ReCoRD [159] Cloze question 120,000 questions Extracted from about news stories online news source.…”
Section: Taskmentioning
confidence: 99%