2017
DOI: 10.48550/arxiv.1710.00477
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Crowd-Annotated Spanish Corpus for Humor Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Typically, these datasets come with a binary labelling, i. e., texts are annotated on whether they are humorous or not. This is different in, e. g., #HashtagWars [24] and the dataset by Castro et al [25], where texts are annotated with different degrees of humour. For task 7 of SemEval-2017 [31], English puns have been labelled for the location and interpretation of the word being responsible for the pun.…”
Section: Textual Humour Recognitionmentioning
confidence: 93%
See 2 more Smart Citations
“…Typically, these datasets come with a binary labelling, i. e., texts are annotated on whether they are humorous or not. This is different in, e. g., #HashtagWars [24] and the dataset by Castro et al [25], where texts are annotated with different degrees of humour. For task 7 of SemEval-2017 [31], English puns have been labelled for the location and interpretation of the word being responsible for the pun.…”
Section: Textual Humour Recognitionmentioning
confidence: 93%
“…Another dataset often used is the Pun of the Day dataset created by Yang et al [23], consisting of puns scraped from a website and non-humorous data acquired from, amongst others, news websites. Recently, social media data became a popular source for textual humour datasets, with datasets based on tweets [24,25,26] and Reddit posts [11] being proposed. While the majority of the available datasets are in English, corpora in Italian [27], Spanish [25], Chinese [28], and Russian [29,30] have been created.…”
Section: Textual Humour Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Textual Content-related features Key concepts (e.g. situation) on which the humorous story is built [8], [9], [24], [39] Incongruity Disconnection Semantic disconnection (e.g., contrast) between two content words in a sentence [8], [9], [14], [45] Intra-sentence repetition Repeating similar objects in a sentence [8], [9], [14], [39], [45] Human-centeredness Polarity Positive/negative orientation of emotion [8], [9], [12], [45], [46] Subjectivity Subjective/objective orientation [9], [45], [47], [48] Phonetic style Alliteration Occurrences of the same letter or sound at the beginning of a group or words [8], [9], [12], [20], [45] Rhyme Repetition of similar sounds in the final stressed syllables of a group of words [9], [12], [20], [45] Build-ups Inter-sentence repetition Concepts (e.g., a person) that have been previously told before a punchline [4], [6], [49] Audio Volume Volume variation Variation in volume: softer and louder [16], [25], [26], [50] Pitch Stress Vocal stress by raising pitch [15], [16], [25], [26],…”
Section: Humor-related Features Subcategory Description Referencesmentioning
confidence: 99%