Companion of the the Web Conference 2018 on the Web Conference 2018 - WWW '18 2018
DOI: 10.1145/3184558.3191640
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Biased Statements in Wikipedia

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
54
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 51 publications
(55 citation statements)
references
References 8 publications
1
54
0
Order By: Relevance
“…Among those who tackle NPOV violations in Wikipedia, some rely on available datasets (Vincze, 2013), others perform manual annotation (Hube and Fetahu, 2018;Ganter and Strube, 2009;Herzig et al, 2011;Al Khatib et al, 2012), still others attempt to automatically extract labeled examples (Ganter and Strube, 2009;Recasens et al, 2013;Hube and Fetahu, 2018). Our approach is in line with the latter.…”
Section: Corporamentioning
confidence: 76%
“…Among those who tackle NPOV violations in Wikipedia, some rely on available datasets (Vincze, 2013), others perform manual annotation (Hube and Fetahu, 2018;Ganter and Strube, 2009;Herzig et al, 2011;Al Khatib et al, 2012), still others attempt to automatically extract labeled examples (Ganter and Strube, 2009;Recasens et al, 2013;Hube and Fetahu, 2018). Our approach is in line with the latter.…”
Section: Corporamentioning
confidence: 76%
“…POS tag are successfully employed in determining text genre [3]. Similarly, POS tags have shown to provide insights in determining biased statements in [15].…”
Section: Statement Representationmentioning
confidence: 99%
“…LIWC Word Functions. LIWC text analysis [20] has been successfully employed in a number of tasks that capture subjectivity of text, such as analyzing language in fake news [22], and additionally as shown in [15], LIWC features when used together with the context of the n-grams proves to provide a high improvement over existing approaches [23] in detecting biased statements.…”
Section: Statement Representationmentioning
confidence: 99%
“…Despite the domain difference, this task is related to ours because Wikipedia is also a source of information which should contain unbiased language. Systems that were employed for this task used a combination of linguistic features, such as POS n-grams and binary features representing the usage of bias words, assertive verbs, factive verbs, hedges and sentiment features (Recasens et al, 2013;Hube and Fetahu, 2018). Most of these features were derived from existing lexicons.…”
Section: Related Workmentioning
confidence: 99%
“…Nouns frequently provide thematic and topical information about a text and adverbs and adjectives can indicate a level of subjectivity. Modals such as would, could and must could additionally carry assertiveness that could be related to bias (Recasens et al, 2013;Hube and Fetahu, 2018). We tried modelling this by extracting n-grams that followed certain patterns such as a) nouns in the middle of a trigram b) particles in the middle of a trigram c) modal verbs in the middle of a trigram d) nouns and their closest preceding adjectives/adverbs e) adjectives/adverbs and words after them.…”
Section: Pos-based N-gram Filteringmentioning
confidence: 99%