Around the world in three alternations

Szmrecsanyi, Benedikt; Grafmiller, Jason; Heller, Benedikt; Röthlisberger, Melanie

doi:10.1075/eww.37.2.01szm

Cited by 173 publications

(39 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Even though these international languages have global speech communities, dialectology and sociolinguistics continue to focus largely on sub-national dialects, often within so-called inner-circle varieties (Kachru, 1982). This paper joins recent work in taking a global approach by using geo-referenced texts (Goldhahn et al, 2012;Davies and Fuchs, 2015;Donoso and Sanchez, 2017) to represent national varieties (Szmrecsanyi et al, 2016;Calle-Martin and Romero-Barranco, 2017;Cook and Brinton, 2017;Rangel et al, 2017;Dunn, 2018aDunn, , 2019bTamaredo, 2018). The basic point is that in order to represent regional variation as a complete system, dialectometry must take a global perspective.…”

Section: Introductionmentioning

confidence: 70%

“…Most previous work relies on phonetic or phonological features (Kretzschmar, 1992(Kretzschmar, , 1996Heeringa, 2004;Labov et al, 2005;Nerbonne, 2006Nerbonne, , 2009Grieve et al, 2011Grieve et al, , 2013Nerbonne, 2011, 2015;Grieve, 2013;Nerbonne and Kretzschmar, 2013;Kretzschmar et al, 2014;Kruger and van Rooy, 2018) for the simple reason that phonetic representations are relatively straight-forward: a vowel is a vowel and the measurements are the same across varieties and languages. Previous work on syntactic variation has focused on either (i) an incomplete set of language-specific variants, ranging from only a few features to hundreds (Sanders, 2007(Sanders, , 2010Szmrecsanyi, 2009Szmrecsanyi, , 2013Szmrecsanyi, , 2014Grieve, 2011Grieve, , 2012Grieve, , 2016Collins, 2012;Schilk and Schaub, 2016;Szmrecsanyi et al, 2016;Calle-Martin and Romero-Barranco, 2017;Grafmiller and Szmrecsanyi, 2018;Tamaredo, 2018) or (ii) language-independent representations such as function words (Argamon and Koppel, 2013) or sequences of part-of-speech labels (Hirst and Feiguina, 2007;Kroon et al, 2018). This forces a choice between either an ad hoc and incomplete syntactic representation or a reproducible but indirect syntactic representation.…”

Section: Introductionmentioning

confidence: 99%

“…While useful for visualizations, these models are difficult to evaluate against ground-truths. Another strand of work models the importance of predictor variables on the use of a particular variant, with geographic region as one possible predictor (Szmrecsanyi et al, 2016). These models are based on multivariate work in sociolinguistics that attempts to find which linguistic, social, or geographic features are most predictive of a particular variant.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology

Dunn

2019

Front. Artif. Intell.

View full text Add to dashboard Cite

The goal of this paper is to provide a complete representation of regional linguistic variation on a global scale. To this end, the paper focuses on removing three constraints that have previously limited work within dialectology/dialectometry. First, rather than assuming a fixed and incomplete set of variants, we use Computational Construction Grammar to provide a replicable and falsifiable set of syntactic features. Second, rather than assuming a specific area of interest, we use global language mapping based on web-crawled and social media datasets to determine the selection of national varieties. Third, rather than looking at a single language in isolation, we model seven major languages together using the same methods: Arabic, English, French, German, Portuguese, Russian, and Spanish. Results show that models for each language are able to robustly predict the region-of-origin of held-out samples better using Construction Grammars than using simpler syntactic features. These global-scale experiments are used to argue that new methods in computational sociolinguistics are able to provide more generalized models of regional variation that are essential for understanding language variation and change at scale.

show abstract

Section: Introductionmentioning

confidence: 70%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology

Dunn

2019

Front. Artif. Intell.

View full text Add to dashboard Cite

show abstract

“…As to grammar specifically, we know that intra-systemic grammatical variation -that is, variation within and across varieties of the same language -is highly systematic, and that the determinants of this variation are numerous, multifactorial, and probabilistically conditioned (e.g. Gries 2003;Bresnan & Hay 2008;Tagliamonte, Durham & Smith 2014;Szmrecsanyi et al 2016). Results of such studies are generally taken to be evidence for a model of grammar that is quantitative and probabilistic.…”

Section: Introductionmentioning

confidence: 99%

“…But it is only in recent years that the predictions outlined in the previous paragraph have begun to be explored more systematically. We take the liberty to illustrate this trend by sketching a research project (2013-2021) based at the KU Leuven and entitled "Exploring probabilistic grammar(s) in varieties of English around the world", which investigates three syntactic alternations (see (1)-(3)) in some nine international varieties of English: British English, Canadian English, Irish English, New Zealand English, Hong Kong English, Indian English, Jamaican English, Philippine English, and Singapore English (Szmrecsanyi et al 2016).…”

Section: Introductionmentioning

confidence: 99%

General introduction: A comparative perspective on probabilistic variation in grammar

Grafmiller

Szmrecsanyi

Röthlisberger

et al. 2018

Glossa: A Journal of General Linguistics

Self Cite

View full text Add to dashboard Cite

This special collection brings together research exploring and evaluating probabilistic variation patterns from a comparative perspective, thus highlighting current work situated at the crossroads of research on usage-based theoretical linguistics, variationist linguistics, and sociolinguistics. The contributions in the collection advance our understanding of the plasticity of syntactic knowledge on the part of language users with diverse regional and/or cultural backgrounds, and demonstrate how a probabilistic approach to grammatical variation can offer insight into the scope and limits of language variation. In this general introduction to the special collection, we provide some essential background for perspective, and subsequently summarize the contributions in the collection.

show abstract

Corpus Linguistics and Asian Englishes

Mukherjee¹,

Bernaisch²

2020

The Handbook of Asian Englishes

View full text Add to dashboard Cite

Around the world in three alternations

Cited by 173 publications

References 34 publications

Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology

Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology

General introduction: A comparative perspective on probabilistic variation in grammar

Corpus Linguistics and Asian Englishes

Contact Info

Product

Resources

About