Spatial Evolution of Human Dialects

Burridge, James

doi:10.1103/physrevx.7.031008

Cited by 34 publications

(104 citation statements)

References 90 publications

(159 reference statements)

Supporting

Mentioning

102

Contrasting

Order By: Relevance

“…For example, protein structure is written in sequences of amino acids, a language of 20 different symbols. A large body of previous work has investigated the social aspect of linguistic systems, namely that different agents must find consensus regarding the meaning of symbols [2,3,4]. A complementary but necessary aspect of any linguistic system concerns the hidden structure within the sequences themselves, independent of communication.…”

mentioning

confidence: 99%

Emergence of order in random languages

DeGiuli¹

2019

J. Phys. A: Math. Theor.

View full text Add to dashboard Cite

We consider languages generated by weighted context-free grammars. It is shown that the behavior of large texts is controlled by saddle-point equations for an appropriate generating function. We then consider ensembles of grammars, in particular the Random Language Model of [1]. This model is solved in the replicasymmetric ansatz, which is valid in the high-temperature, disordered phase. It is shown that in the phase in which languages carry information, the replica symmetry must be broken.Many complex systems have a generative, or linguistic, aspect. For example, protein structure is written in sequences of amino acids, a language of 20 different symbols. A large body of previous work has investigated the social aspect of linguistic systems, namely that different agents must find consensus regarding the meaning of symbols [2,3,4]. A complementary but necessary aspect of any linguistic system concerns the hidden structure within the sequences themselves, independent of communication. The most basic structural property is syntax: the rules that govern how symbols can be combined to create richer structures and thus carry information. In computer science and linguistics, generative grammar has proved to be a valuable formalism to describe syntax, in a generalized sense [5,6,7]. A generative grammar consists of an alphabet of hidden symbols, an alphabet of observable symbols, and a set of rules, which allow certain combinations of symbols to be replaced by others. From an initial start symbol S, one progressively applies the rules until only observable symbols remain; any sentence produced this way is said to be grammatical, and the set of all such sentences is called the language of the grammar. The sequence of rule applications is called a derivation. The Chomsky hierarchy distinguishes grammars based on the complexity of the grammatical rules. In this work, we restrict our attention to context-free grammars (CFGs), for which derivations are trees (Figure 1).There are many theoretical results on the capabilities of CFGs [7]. However, little is known about the statistical properties of large, typical grammars. Recently, there has been increasing interest in approaching the properties of syntax from the point of arXiv:1902.07516v2 [cond-mat.dis-nn]

show abstract

mentioning

confidence: 99%

Emergence of order in random languages

DeGiuli¹

2019

J. Phys. A: Math. Theor.

View full text Add to dashboard Cite

show abstract

“…We note the close match, and also that clusters appear to have densely populated areas at their heart with boundaries lying in less densely populated areas. These features were predicted by the memory based surface tension models [20,21] upon which the current paper builds. .…”

Section: Resultsmentioning

confidence: 99%

“…The social network through which linguistic forms spread may therefore be viewed as quasi two-dimensional, provided we take a sufficiently coarse grained view of the system. This has geometrical implications for the conformity driven evolution of language; if the social network over which language evolves is two dimensional, then linguistic boundaries may be viewed as lines and by analogy with conformity driven physical systems, we would expect these to feel surface tension [20,21,24]. We also observe from Figure 6 that the distribution of connections is not isotropic: a disproportionate number of edges appear to run closer to the east-west direction than to north-south.…”

Section: Survey Datamentioning

confidence: 89%

“…Modern computers and the creation of the internet have dramatically improved data collection and analysis [4][5][6][7][8][9][10][11][12], and social media has provided a new source of linguistic data [13]. Modelling linguistic evolution has also emerged as a sub-field of statistical physics where ideas and techniques employed to relate the macroscopic behaviour of physical systems to their microscopic components have been applied [14][15][16][17][18][19][20][21][22]. However, there is a need to develop mathematical models which provide a scientific understanding of how humanlevel processes [23] give rise to the observed geographical distributions and language dynamics.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Statistical physics of language maps in the USA

et al. 2019

View full text Add to dashboard Cite

Spatial linguistic surveys often reveal well defined geographical zones where certain linguistic forms are dominant over their alternatives. It has been suggested that these patterns may be understood by analogy with coarsening in models of two dimensional physical systems. Here we investigate this connection by comparing data from the Cambridge Online Survey of World Englishes to the behaviour of a generalised zero temperature Potts model with long range interactions. The relative displacements of linguistically similar population centres reveals enhanced east-west affinity. Cluster analysis reveals three distinct linguistic zones. We find that when the interaction kernel is made anisotropic by stretching along the east-west axis, the model can reproduce the three linguistic zones for all interaction parameters tested. The model results are consistent with a view held by some linguists that, in the USA, language use is, or has been, exchanged or transmitted to a greater extent along the east-west axis than the north-south.

show abstract

“…Moreover, certain arrangements within vowel space are particularly common [12][13][14]. Vowel systems, like most elements of languages, evolve over time and may therefore be viewed as dynamical systems coupled to human social dynamics, and also to geography and social networks [15][16][17]. Cross-linguistic similarities suggest that their internal dynamics may play a particularly powerful role, and numerous models have been proposed [13,14,[18][19][20][21][22][23][24][25].…”

Section: Introductionmentioning

confidence: 99%

Brownian dynamics for the vowel sounds of human language

Burridge

Vaux

2020

Phys. Rev. Research

View full text Add to dashboard Cite

We present a model for the evolution of vowel sounds in human languages, in which words behave as Brownian particles diffusing in acoustic space, interacting via the vowel sounds they contain. Interaction forces, derived from a simple model of the language-learning process, are attractive at short range and repulsive at long range. This generates sets of acoustic clusters, each representing a distinct sound, which form patterns with similar statistical properties to real vowel systems. Our formulation may be generalized to account for spontaneous self-actuating shifts in system structure which are observed in real languages, and to combine in one model two previously distinct theories of vowel system structure: dispersion theory, which assumes that vowel systems maximize contrasts between sounds, and quantal theory, according to which nonlinear relationships between articulatory and acoustic parameters are the source of patterns in sound inventories. By formulating the dynamics of vowel sounds using interparticle forces, we also provide a simple unified description of the linguistic notion of push and pull dynamics in vowel systems.

show abstract

Spatial Evolution of Human Dialects

Cited by 34 publications

References 90 publications

Emergence of order in random languages

Emergence of order in random languages

Statistical physics of language maps in the USA

Brownian dynamics for the vowel sounds of human language

Contact Info

Product

Resources

About