For now more than four decades, quantitative protest event analysis (PEA) has routinely contributed to the testing and refinement of theories on political processes from different perspectives. However, it is commonly agreed that PEA data face serious challenges regarding their data sources. Precisely, researchers applying PEA struggle with the fact that they cannot use multiple sources for large geographical areas and long time periods. As a consequence, most of the scholarship still focuses on a narrow set of European countries or the United States and does not cover the period since the early 2000s. We are bringing PEA and computational linguistics together to suggest and evaluate an approach that will enable political scientists to extend their research designs with a more efficient and at the same time reliable data collection. The approach relies on hidden topic models, word space models, and named entity recognition to identify and code protest events.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.