2008
DOI: 10.1007/978-3-540-78246-9_76
|View full text |Cite
|
Sign up to set email alerts
|

Projecting Dialect Distances to Geography: Bootstrap Clustering vs. Noisy Clustering

Abstract: Abstract. Dialectometry produces aggregate distance matrices in which a distance is specified for each pair of sites. By projecting groups obtained by clustering onto geography one compares results with traditional dialectology, which produced maps partitioned into implicitly non-overlapping dialect areas. The importance of dialect areas has been challenged by proponents of continua, but they too need to compare their findings to older literature, expressed in terms of areas.Simple clustering is unstable, mean… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
45
0
5

Year Published

2009
2009
2023
2023

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 50 publications
(51 citation statements)
references
References 9 publications
1
45
0
5
Order By: Relevance
“…The traditional approach to dialectology is to find the geographical distribution of known lexical alternatives (e.g. you, yall and yinz: (Labov et al, 2005;Nerbonne et al, 2008;Gonçalves and Sánchez, 2014;Doyle, 2014;Huang et al, 2015;Nguyen and Eisenstein, 2016)), the shortcoming of which is that the alternative lexical variables must be known beforehand. There have also been attempts to automatically identify such words from geotagged documents (Eisenstein et al, 2010;Ahmed et al, 2013;Eisenstein, 2015).…”
Section: Related Workmentioning
confidence: 99%
“…The traditional approach to dialectology is to find the geographical distribution of known lexical alternatives (e.g. you, yall and yinz: (Labov et al, 2005;Nerbonne et al, 2008;Gonçalves and Sánchez, 2014;Doyle, 2014;Huang et al, 2015;Nguyen and Eisenstein, 2016)), the shortcoming of which is that the alternative lexical variables must be known beforehand. There have also been attempts to automatically identify such words from geotagged documents (Eisenstein et al, 2010;Ahmed et al, 2013;Eisenstein, 2015).…”
Section: Related Workmentioning
confidence: 99%
“…By contrast, in linguistics and particularly in corpus linguistics, we find a long and strongly entrenched tradition of looking at individual features in isolation, which is partly a legacy of the discipline's philological origins, and partly a convenience issue. In any event, the one-feature-at-a-time line of analysis -exceptions such as the multidimensional register studies in the spirit of Biber (1988) notwithstanding -has yielded a corpus-based dialectology literature dominated by an abundance of what Nerbonne (2008) has referred to as 'single-feature-based studies'. We will refrain from citing actual studies here (but see the survey in ANDERWALD; SZMRECSANYI, 2009), though fictitious titles such as 'Verbal complementation in West Yorkshire English' or 'The KIT vowel in Appalachian English' are entirely realistic.…”
Section: Holistic Analysis -Why?mentioning
confidence: 99%
“…Starting in the 1970s, computationally inclined dialectologists have addressed these worries by developing a methodology known as DIALECTOMETRY. Dialectometry is defined as the branch of geolinguistics concerned with measuring, visualizing, and analyzing aggregate dialect similarities or distances as a function of properties of geographic space (for seminal work, see SÉGUY, 1971;GOEBL, 1982;GOEBL, 1984;KLEIWEG, 1999;NERBONNE, 2005;GOEBL, 2006;KLEIWEG, 2007). Dialectometrical inquiry marshals computational approaches to identify "general, seemingly hidden structures from a larger amount of features" (GOEBL; SCHILTZ, 1997, p. 13) and puts a strong emphasis on quantification, cartographic visualization, and exploratory data analysis to infer patterns from feature aggregates.…”
Section: Corpus-based Dialectometrymentioning
confidence: 99%
See 1 more Smart Citation
“…The resulting higher-order structures can be visualised, for example, via socalled composite cluster maps (see Nerbonne et al 2008 for a discussion). These highlight the fuzzy nature of dialect boundaries such that darker borders between localities represent more robust linguistic oppositions (which, thanks to the clustering-with-noise technique utilized, can be considered statistically significant).…”
Section: Classification and Validationmentioning
confidence: 99%