Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1246
|View full text |Cite
|
Sign up to set email alerts
|

You Write like You Eat: Stylistic Variation as a Predictor of Social Stratification

Abstract: Inspired by Labov's seminal work on stylistic variation as a function of social stratification, we develop and compare neural models that predict a person's presumed socio-economic status, obtained through distant supervision, from their writing style on social media. The focus of our work is on identifying the most important stylistic parameters to predict socioeconomic group. In particular, we show the effectiveness of morpho-syntactic features as stylistic predictors of socio-economic group, in contrast to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 36 publications
2
2
0
Order By: Relevance
“…The averaged results were reported. Results For the challenging task of SES prediction, all models attain moderate performance that is consistently above chance level (Table 13), echoing previous findings (Flekova et al, 2016;Basile et al, 2019). Compared to the fine-tuned RoBERTa, the idiolectal features have filtered out some SES-related variations, which could be related to domain-specific information.…”
Section: F Additional Analysis: Characterizing Sociolectssupporting
confidence: 79%
See 3 more Smart Citations
“…The averaged results were reported. Results For the challenging task of SES prediction, all models attain moderate performance that is consistently above chance level (Table 13), echoing previous findings (Flekova et al, 2016;Basile et al, 2019). Compared to the fine-tuned RoBERTa, the idiolectal features have filtered out some SES-related variations, which could be related to domain-specific information.…”
Section: F Additional Analysis: Characterizing Sociolectssupporting
confidence: 79%
“…Dataset compilation From the test set, we created a small subset of high socioeconomic status (SES) users and low SES users by using the prices of the reviewed products as a proxy. We verified that there is a clear distinction in readability between high SES and low SES groups, which is a reliable linguistic indicator of SES (Flekova et al, 2016;Basile et al, 2019).…”
Section: F Additional Analysis: Characterizing Sociolectssupporting
confidence: 62%
See 2 more Smart Citations