Proceedings of the 11th Knowledge Capture Conference 2021
DOI: 10.1145/3460210.3493553
|View full text |Cite
|
Sign up to set email alerts
|

Capturing Contentiousness

Abstract: Recent initiatives by cultural heritage institutions in addressing outdated and offensive language used in their collections demonstrate the need for further understanding into when terms are problematic or contentious. This paper presents an annotated dataset of 2,715 unique samples of terms in context, drawn from a historical newspaper archive, collating 21,800 annotations of contentiousness from expert and crowd workers.We describe the contents of the corpus by analysing inter-rater agreement and difference… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 10 publications
0
0
0
Order By: Relevance
“…The development of a knowledge graph in this paper extends our previous work, in which we constructed a crowdsource-annotated corpus of contentious terms in contexts taken from historical newspapers [5]. The corpus was used for machine-learning based detection of contentiousness.…”
Section: Related Workmentioning
confidence: 81%
See 2 more Smart Citations
“…The development of a knowledge graph in this paper extends our previous work, in which we constructed a crowdsource-annotated corpus of contentious terms in contexts taken from historical newspapers [5]. The corpus was used for machine-learning based detection of contentiousness.…”
Section: Related Workmentioning
confidence: 81%
“…For example, institutions provide explanations about inappropriate terminology in content warnings accompanying online collections 4 or publish general statements on their websites. 5 There is expert knowledge about problematic terminology that GLAM and other actors have produced, however, this knowledge is often detached from digital collections [18]. While object descriptions in collections are structured and often interconnected in knowledge organisation systems (KOS) used by heritage institutions, the domain expertise and discussions about problematic words in these collections exist in separate publications in different formats.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation