2022
DOI: 10.1126/sciadv.abg2652
|View full text |Cite
|
Sign up to set email alerts
|

How to make causal inferences using texts

Abstract: Text as data techniques offer a great promise: the ability to inductively discover measures that are useful for testing social science theories with large collections of text. Nearly all text-based causal inferences depend on a latent representation of the text, but we show that estimating this latent representation from the data creates underacknowledged risks: we may introduce an identification problem or overfit. To address these risks, we introduce a split-sample workflow for making rigorous causal inferen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(29 citation statements)
references
References 57 publications
0
29
0
Order By: Relevance
“…How does showing someone an ad affect how they write about a product? Analyzing text data to answer these types of causal questions is the problem addressed in the excellent article of Egami et al ( 5 ).…”
Section: Causal Inference From Text Datamentioning
confidence: 99%
See 3 more Smart Citations
“…How does showing someone an ad affect how they write about a product? Analyzing text data to answer these types of causal questions is the problem addressed in the excellent article of Egami et al ( 5 ).…”
Section: Causal Inference From Text Datamentioning
confidence: 99%
“…What should we look for in a textual code that is used for causal inference? Inspired by the framework developed by Egami et al ( 5 ), we look further at the different relationships that a code can have with the text. These ways of thinking could lead to further research and methods for causal inference from text.…”
Section: Thinking More About the Codementioning
confidence: 99%
See 2 more Smart Citations
“…We demonstrate that widely recognized, emotion-laden events have immediate but no lasting effects on interpretations of immigration and that it takes prolonged periods of rupture to change them. This event analysis exemplifies how researchers can use the output of computational text models to trace causal claims within, for example, the traditional regression model framework (Egami et al 2018, Roberts et al 2020, Fong & Grimmer 2019, Feder et al 2021, Gencoglu & Gruber 2020.…”
Section: Introductionmentioning
confidence: 98%