2020
DOI: 10.48550/arxiv.2010.10833
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

KnowDis: Knowledge Enhanced Data Augmentation for Event Causality Detection via Distant Supervision

Abstract: Modern models of event causality detection (ECD) are mainly based on supervised learning from small hand-labeled corpora. However, hand-labeled training data is expensive to produce, low coverage of causal expressions and limited in size, which makes supervised methods hard to detect causal relations between events. To solve this data lacking problem, we investigate a data augmentation framework for ECD, dubbed as Knowledge Enhanced Distant Data Augmentation (KnowDis). Experimental results on two benchmark dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…For example, the largest widely used public ECI dataset is the EventStoryLine Corpus (Caselli and Vossen 2017), which contains 258 documents consisting of 4,316 sentences, and only 1,770 out of 7,805 event pairs are annotated as causal relations. This situation poses challenges to existing data-hungry deep learning-based approaches for ECI tasks (Zuo et al 2020;Cao et al 2021;, which mainly utilize language models to model sentence context and treat the ECI task as a binary classification problem. Therefore, how to efficiently utilize limited data is one essential problem to be solved for ECI.…”
Section: Intra-sentence Causalitymentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the largest widely used public ECI dataset is the EventStoryLine Corpus (Caselli and Vossen 2017), which contains 258 documents consisting of 4,316 sentences, and only 1,770 out of 7,805 event pairs are annotated as causal relations. This situation poses challenges to existing data-hungry deep learning-based approaches for ECI tasks (Zuo et al 2020;Cao et al 2021;, which mainly utilize language models to model sentence context and treat the ECI task as a binary classification problem. Therefore, how to efficiently utilize limited data is one essential problem to be solved for ECI.…”
Section: Intra-sentence Causalitymentioning
confidence: 99%
“…Such approaches rely on the domain knowledge of human. Deep learning-based approaches for ECI (Kadowaki et al 2019;Zuo et al 2020) leverage pretrained language models (e.g., BERT (Devlin et al 2018)) and commonsense knowledge sources (e.g., ConceptNet (Speer, Chin, and Havasi 2017)) to improve the performance. To deal with implicit causal relations, Cao et al (2021) conducts a descriptive graph induction module combining external knowledge and achieves promising results.…”
Section: Related Workmentioning
confidence: 99%
“…These methods involve the generation of supplementary training data through knowledge graph supervision or the direct incorporation of knowledge graph triplets associated with events into textual contexts. Additionally, knowledge bases like FrameNet [18], WordNet [19], and VerbNet [20] have also been employed to extract external causal knowledge for the Event Causality Identification (ECI) task [21,22]. These lexical knowledge bases contribute to augmenting the understanding of causal relationships in ECI through their diverse information structures.…”
Section: Introductionmentioning
confidence: 99%