Proceedings of the 28th International Conference on Computational Linguistics 2020
DOI: 10.18653/v1/2020.coling-main.467
|View full text |Cite
|
Sign up to set email alerts
|

CosMo: Conditional Seq2Seq-based Mixture Model for Zero-Shot Commonsense Question Answering

Abstract: Commonsense reasoning refers to the ability of evaluating a social situation and acting accordingly. Identification of the implicit causes and effects of a social context is the driving capability which can enable machines to perform commonsense reasoning. The dynamic world of social interactions requires context-dependent on-demand systems to infer such underlying information. However, current approaches in this realm lack the ability to perform commonsense reasoning upon facing an unseen situation, mostly du… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Recent approaches have tried to develop models with the human psychological and behavioural capabilities (Jiang et al, 2021;Botzer et al, 2022;Lourie et al, 2021). Other approaches targeted identifying implicit social paradigms by developing sequence generation models (Moghimifar et al, 2020;Bosselut et al, 2019). However, the task of socio-cultural norm discovery has been overlooked, mostly due to the lack of proper annotated data (Fung et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…Recent approaches have tried to develop models with the human psychological and behavioural capabilities (Jiang et al, 2021;Botzer et al, 2022;Lourie et al, 2021). Other approaches targeted identifying implicit social paradigms by developing sequence generation models (Moghimifar et al, 2020;Bosselut et al, 2019). However, the task of socio-cultural norm discovery has been overlooked, mostly due to the lack of proper annotated data (Fung et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…"xNeed" and "xIntent" are chosen for CSKG construction, E vv , since they deal with what is needed or intended for the event to occur, while "xWant" and "xEffect" for scoring the natural language actions, E va , since they deal with what the player would do following the event. We further set n hop = 1 and n gen = 2 from the observation that they are good enough for zero-shot commonsense question answering Moghimifar et al, 2020). During the online training of the agent, we freeze the parameters for COMET.…”
Section: B Experiments Setupmentioning
confidence: 99%
“…Tony may feel afraid and want to run away because of what he has done wrong. Atomic models are mainly generated by seq2seq 11 (Sequence to Sequence).…”
Section: Knowledge Base Acquisitionmentioning
confidence: 99%