2020
DOI: 10.5334/gjgl.909
|View full text |Cite
|
Sign up to set email alerts
|

Challenges in detecting evolutionary forces in language change using diachronic corpora

Abstract: Newberry et al. (Detecting evolutionary forces in language change, Nature 551, 2017) tackle an important but difficult problem in linguistics, the testing of selective theories of language change against a null model of drift. Having applied a test from population genetics (the Frequency Increment Test) to a number of relevant examples, they suggest stochasticity has a previously under-appreciated role in language evolution. We replicate their results and find that while the overall observation holds, results … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
38
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(43 citation statements)
references
References 65 publications
5
38
0
Order By: Relevance
“…Note, however, that the FIT is not applicable for approximately 120 of the 1000 generated time series, since they fail the normality test. As was shown in previous analyses (Feder et al, 2014;Karjus et al, 2020), the statistical power of the FIT is low with very small selection coefficients. At β = 0.001, most simulated time series are virtually indistinguishable from stochastic drift (6% accurate and 8.5% inapplicable).…”
Section: Critical Parameter Analysissupporting
confidence: 66%
See 3 more Smart Citations
“…Note, however, that the FIT is not applicable for approximately 120 of the 1000 generated time series, since they fail the normality test. As was shown in previous analyses (Feder et al, 2014;Karjus et al, 2020), the statistical power of the FIT is low with very small selection coefficients. At β = 0.001, most simulated time series are virtually indistinguishable from stochastic drift (6% accurate and 8.5% inapplicable).…”
Section: Critical Parameter Analysissupporting
confidence: 66%
“…On the one hand, there are still several issues with the FIT that we have not addressed in the current study, and, on the other hand, the TSC itself is not without flaws either. We highlight two additional problems of the FIT mentioned in the literature (Karjus et al, 2020). First, the FIT assumes a constant selection coefficient β for the entire investigated period.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Analysis of natural language from a complex systems perspective has provided new insights into statistical properties of language, such as statistical laws [ 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 ], networks [ 10 , 11 , 12 , 13 , 14 ], language change [ 15 , 16 , 17 , 18 , 19 , 20 ], quantification of information content [ 21 , 22 , 23 , 24 ], or the role of syntactic structures [ 25 ] or punctuation [ 26 ], etc. In particular, the availability of new and large publicly available datasets such as the google-ngram data [ 27 ], the full Wikipedia dataset [ 28 , 29 ], or Twitter [ 30 ] opened the door for new large-scale quantitative approaches.…”
Section: Introductionmentioning
confidence: 99%