Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1394
|View full text |Cite
|
Sign up to set email alerts
|

Large Dataset and Language Model Fun-Tuning for Humor Recognition

Abstract: The task of humor recognition has attracted a lot of attention recently due to the urge to process large amounts of user-generated texts and rise of conversational agents. We collected a dataset of jokes and funny dialogues in Russian from various online resources and complemented them carefully with unfunny texts with similar lexical properties. The dataset comprises of more than 300,000 short texts, which is significantly larger than any previous humor-related corpus. Manual annotation of about 2,000 items p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 40 publications
(18 citation statements)
references
References 16 publications
0
16
0
Order By: Relevance
“…Yang et al (2015) scraped puns from the Pun of the Day website 1 and negative examples from various news websites. There is also work on the curation of non-English humor datasets (Zhang et al, 2019;Blinov et al, 2019). Hasan et al (2019) developed UR-FUNNY, a multimodal humor dataset that involves text, audio and video information extracted from TED talks.…”
Section: Related Workmentioning
confidence: 99%
“…Yang et al (2015) scraped puns from the Pun of the Day website 1 and negative examples from various news websites. There is also work on the curation of non-English humor datasets (Zhang et al, 2019;Blinov et al, 2019). Hasan et al (2019) developed UR-FUNNY, a multimodal humor dataset that involves text, audio and video information extracted from TED talks.…”
Section: Related Workmentioning
confidence: 99%
“…In this section, we briefly describe other work in this area. In this approach (Blinov et al, 2019), the authors have used universal language model fine-tuning method for humour recognition. Convolutional neural networks (CNN) have also been used for this task by (Chen and Soo, 2018) whereas (Weller and Seppi, 2019) used transformers to classify humour.…”
Section: Related Workmentioning
confidence: 99%
“…Owing to the subjective nature of humor, there have been recent efforts in collecting datasets for humor; Blinov et al, (2019) collected a dataset of jokes and funny dialogues in Russian from various online resources, and complemented them carefully with unfunny texts with similar lexical properties. They developed a fine-tuned language model for text classification with a significant gain over baseline methods.…”
Section: Related Workmentioning
confidence: 99%