A General Method for Transferring Explicit Knowledge into Language Model Pretraining

Yan, Ruiqing; Sun, Lanchang; Wang, Fang; Zhang, Xiaoming

doi:10.1155/2021/7115167

Cited by 1 publication

(1 citation statement)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Explicit knowledge integration of knowledge resources into language models can be roughly categorized into fusion based approaches and language modeling based approaches. Fusion based approaches (Peters et al, 2019;Wang et al, 2021;Yan et al, 2021) typically perform knowledge integration by combining language model representations with representations extracted from knowledge bases. Compared to language modeling based approaches, as explored by us, they rely on aligned data and are typically applied during the pre-training stage.…”

Section: Related Workmentioning

confidence: 99%

StereoKG: Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes

Deshpande¹,

Ruiter²,

Mosbach³

et al. 2022

Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH)

View full text Add to dashboard Cite

Analyzing ethnic or religious bias is important for improving fairness, accountability, and transparency of natural language processing models. However, many techniques rely on human-compiled lists of bias terms, which are expensive to create and are limited in coverage. In this study, we present a fully datadriven pipeline for generating a knowledge graph (KG) of cultural knowledge and stereotypes. Our resulting KG covers 5 religious groups and 5 nationalities and can easily be extended to include more entities. Our human evaluation shows that the majority (59.2%) of non-singleton entries are coherent and complete stereotypes. We further show that performing intermediate masked language model training on the verbalized KG leads to a higher level of cultural awareness in the model and has the potential to increase classification performance on knowledge-crucial samples on a related task, i.e., hate speech detection.

show abstract

Section: Related Workmentioning

confidence: 99%