Yang Trista Cao scite author profile

Yang Trista Cao

3Publications

9Citation Statements Received

59Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

Cao¹,

Sotnikova²,

Daumé³

et al. 2022

View full text Add to dashboard Cite

NLP models trained on text have been shown to reproduce human stereotypes, which can magnify harms to marginalized groups when systems are deployed at scale. We adapt the Agency-Belief-Communion (ABC) stereotype model of Koch et al. (2016) from social psychology as a framework for the systematic study and discovery of stereotypic group-trait associations in language models (LMs). We introduce the sensitivity test (SeT) for measuring stereotypical associations from language models. To evaluate SeT and other measures using the ABC model, we collect group-trait judgments from U.S.-based subjects to compare with English LM stereotypes. Finally, we extend this framework to measure LM stereotyping of intersectional identities.

show abstract

Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

Cao¹,

Sotnikova²,

Daumé³

et al. 2022

Preprint

View full text Add to dashboard Cite

On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations

Cao¹,

Pruksachatkun²,

Chang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Multiple metrics have been introduced to measure fairness in various natural language processing tasks. These metrics can be roughly categorized into two categories: 1) extrinsic metrics for evaluating fairness in downstream applications and 2) intrinsic metrics for estimating fairness in upstream contextualized language representation models. In this paper, we conduct an extensive correlation study between intrinsic and extrinsic metrics across bias notions using 19 contextualized language models. We find that intrinsic and extrinsic metrics do not necessarily correlate in their original setting, even when correcting for metric misalignments, noise in evaluation datasets, and confounding factors such as experiment configuration for extrinsic metrics.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yang Trista Cao

Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations

Contact Info

Product

Resources

About