FPAI at SemEval-2021 Task 6: BERT-MRC for Propaganda Techniques Detection

Hou, Xiaolong; Junsong, Ren; Rao, Gang; Lian, Lianxin; Ruan, Zhihao; Mo, Yang; Shen, Jianping

doi:10.18653/v1/2021.semeval-1.146

Cited by 1 publication

(2 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(2) Self-training is to first train a model with manually labeled data, then use the model to automatically label unlabeled data, and finally leverage the manually and automatically labeled data to enhance itself (Xie et al, 2019(Xie et al, , 2020. It shows promising results in many SpanID tasks, including NER (Wang et al, 2020), propaganda detection (Hou et al, 2021) (Seo et al, 2016;Chen et al, 2017), while recent trends have shown great advantages of formulating NLP tasks as MRC problems. In the con- (Li et al, 2019a), event detection ), and summarization (McCann et al, 2018 are also reported to benefit from the MRC paradigm.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks

Xu¹,

Li²,

Yang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Span Identification (SpanID) is a family of NLP tasks that aims to detect and classify text spans. Different from previous works that merely leverage Subordinate (SUB) relation about if a span is an instance of a certain category to train SpanID models, we explore Peer (PR) relation, which indicates that the two spans are two different instances from the same category sharing similar features, and propose a novel Peer Data Augmentation (PeerDA) approach to treat span-span pairs with the PR relation as a kind of augmented training data. PeerDA has two unique advantages: (1) There are a large number of spanspan pairs with the PR relation for augmenting the training data. (2) The augmented data can prevent over-fitting to the superficial spancategory mapping by pushing SpanID models to leverage more on spans' semantics. Experimental results on ten datasets over four diverse SpanID tasks across seven domains demonstrate the effectiveness of PeerDA. Notably, seven of them achieve state-of-the-art results.

show abstract

Section: Related Workmentioning

confidence: 99%

“…Therefore, we only compare with MRC. For So-cial21, we compare with top three approaches in its leaderboard, namely, Volta (Gupta et al, 2021), HOMADOS (Kaczyński and Przybyła, 2021), and TeamFPAI (Hou et al, 2021).…”

Section: Baselinesmentioning

confidence: 99%