What Sounds “Right” to Me? Experiential Factors in the Perception of Political Ideology

Shen, Qinlan; Rosé, Carolyn Penstein

doi:10.18653/v1/2021.eacl-main.152

Cited by 6 publications

(6 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Joseph et al (2017) showed that how one constructs annotation tasks can significantly impact (supervised) model performance and one's assessment of it. Further, as demonstrated by Shen and Rose (2021) on the closely related task of inferring political ideology, annotator expertise and subjectivity also play an important role in the quality of annotated data.…”

Section: Stance Detection and Annotationmentioning

confidence: 99%

“…The present work complements these prior efforts by delving into other questions of annotator disagreement and inference. Whereas prior work has considered disagreement arising from task differences (Joseph et al, 2017), or properties of the annotators (Shen and Rose, 2021), we control for both of these factors, taking a single task and a relatively homogenous set of expert annotators. Instead, extending recent work studying prediction on multiple targets (van den Berg et al, 2019;, we study how agreement varies depending on the target selected, and how even within a single task design, annotators can come to rely on distinct subsets of information.…”

Section: Stance Detection and Annotationmentioning

confidence: 99%

See 1 more Smart Citation

(Mis)alignment Between Stance Expressed in Social Media Data and Public Opinion Surveys

Joseph¹,

Shugars²,

Gallagher³

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Stance detection, which aims to determine whether an individual is for or against a target concept, promises to uncover public opinion from large streams of social media data. Yet even human annotation of social media content does not always capture "stance" as measured by public opinion polls. We demonstrate this by directly comparing an individual's self-reported stance to the stance inferred from their social media data. Leveraging a longitudinal public opinion survey with respondent Twitter handles, we conducted this comparison for 1,129 individuals across four salient targets. We find that recall is high for both "Pro" and "Anti" stance classifications but precision is variable in a number of cases. We identify three factors leading to the disconnect between text and author stance: temporal inconsistencies, differences in constructs, and measurement errors from both survey respondents and annotators. By presenting a framework for assessing the limitations of stance detection models, this work provides important insight into what stance detection truly measures.

show abstract

Section: Stance Detection and Annotationmentioning

confidence: 99%

Section: Stance Detection and Annotationmentioning

confidence: 99%

(Mis)alignment Between Stance Expressed in Social Media Data and Public Opinion Surveys

Joseph¹,

Shugars²,

Gallagher³

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…However, the contextual information is local and does not take into account global subject content under discussion. Shen and Rose (2021) investigate the influence of annotators’ political beliefs and familiarity of the source towards annotations of identifying political orientation of Reddit posts. Chung et al (2019) present a dataset that collects both hateful and toxic messages along with potential repairs that can provide counter-narrative information with fact-based information and non-offensive language to de-escalate hateful discourse.…”

Section: Background and Related Workmentioning

confidence: 99%

A study towards contextual understanding of toxicity in online conversations

Madhyastha,

Founta,

Specia

2023

Nat. Lang. Eng.

View full text Add to dashboard Cite

Identifying and annotating toxic online content on social media platforms is an extremely challenging problem. Work that studies toxicity in online content has predominantly focused on comments as independent entities. However, comments on social media are inherently conversational, and therefore, understanding and judging the comments fundamentally requires access to the context in which they are made. We introduce a study and resulting annotated dataset where we devise a number of controlled experiments on the importance of context and other observable confounders – namely gender, age and political orientation – towards the perception of toxicity in online content. Our analysis clearly shows the significance of context and the effect of observable confounders on annotations. Namely, we observe that the ratio of toxic to non-toxic judgements can be very different for each control group, and a higher proportion of samples are judged toxic in the presence of contextual information.

show abstract

“…Our findings from simulations provide directions for user experiments. Human perception -and thus human annotators' interpretation -is influenced by human factors such as preferences, cultural differences, bias, domain expertise, fatigue, time on task, or mood at annotation time (Alm, 2012;Amidei et al, 2020;Shen and Rose, 2021). Generally, experts with long-standing practice or in-depth knowledge may also not share consensus (Plank et al, 2014).…”

Section: Introductionmentioning

confidence: 99%

Proceedings of the Second Workshop on Bridging Human--Computer Interaction and Natural Language Processing

2022

View full text Add to dashboard Cite

An existing domain taxonomy for normalizing content is often assumed when discussing approaches to information extraction, yet often in real-world scenarios there is none. When one does exist, as the information needs shift, it must be continually extended. This is a slow and tedious task, and one that does not scale well. Here we propose an interactive tool that allows a taxonomy to be built or extended rapidly and with a human in the loop to control precision. We apply insights from text summarization and information extraction to reduce the search space dramatically, then leverage modern pretrained language models to perform contextualized clustering of the remaining concepts to yield candidate nodes for the user to review. We show this allows a user to consider as many as 200 taxonomy concept candidates an hour to quickly build or extend a taxonomy to better fit information needs.

show abstract

What Sounds “Right” to Me? Experiential Factors in the Perception of Political Ideology

Cited by 6 publications

References 31 publications

(Mis)alignment Between Stance Expressed in Social Media Data and Public Opinion Surveys

(Mis)alignment Between Stance Expressed in Social Media Data and Public Opinion Surveys

A study towards contextual understanding of toxicity in online conversations

Proceedings of the Second Workshop on Bridging Human--Computer Interaction and Natural Language Processing

Contact Info

Product

Resources

About