Viveka Velupillai scite author profile

This paper describes a computerized alternative to glottochronology for estimating elapsed time since parent languages diverged into daughter languages. The method, developed by the Automated Similarity Judgment Program (ASJP) consortium, is different from glottochronology in four major respects: (1) it is automated and thus is more objective, (2) it applies a uniform analytical approach to a single database of worldwide languages, (3) it is based on lexical similarity as determined from Levenshtein (edit) distances rather than on cognate percentages, and (4) it provides a formula for date calculation that mathematically recognizes the lexical heterogeneity of individual languages, including parent languages just before their breakup into daughter languages. Automated judgments of lexical similarity for groups of related languages are calibrated with historical, epigraphic, and archaeological divergence dates for 52 language groups. The discrepancies between estimated and calibration dates are found to be on average 29% as large as the estimated dates themselves, a figure that does not differ significantly among language families. As a resource for further research that may require dates of known level of accuracy, we offer a list of ASJP time depths for nearly all the world's recognized language families and for many subfamilies. The greater the degree of linguistic differentiation within a stock, the greater is the period of time that must be assumed for the development of such differentiations.

show abstract

Adding typology to lexicostatistics: A combined approach to language classification

Bakker

Müller

Velupillai

et al. 2009

110

View full text Add to dashboard Cite

The ASJP project aims at establishing relationships between languages on the basis of the Swadesh word list. For this purpose, lists have been collected and phonologically transcribed for almost 3,500 languages. Using a method based on the algorithm proposed by Levenshtein (1966), a custom-made computer program calculates the distances between all pairs of languages in the database. Standard software is used to express the relationships between languages graphically. The current article compares the results of our lexiconbased approach with the results of a similar exercise that takes the typological variables contained in the WALS database as a point of departure. We establish that the latter approach leads to even better results than the lexicon-based one. The best result in terms of correspondence with some well-established genetic and areal classifications, however, is attained when the lexical and typological methods are combined, especially if we select both the most stable Swadesh items and the most stable WALS variables.

show abstract

Automated classification of the world′s languages: a description of the method and preliminary results

Brown

Holman²,

Wichmann

et al. 2008

View full text Add to dashboard Cite

An approach to the classification of languages through automated lexical comparison is described. This method produces near-expert classifications. At the core of the approach is the Automated Similarity Judgment Program (ASJP). ASJP is applied to 100-item lists of core vocabulary from 245 globally distributed languages. The output is 29,890 lexical similarity percentages for the same number of paired languages. Percentages are used as a database in a program originally designed for generating phylogenetic trees in biology. This program yields branching structures (ASJP trees) reflecting the lexical similarity of languages. ASJP trees for languages of the sample spoken in Middle and South America show that the method is capable of grouping together on distinct branches languages of non-controversial genetic groups. In addition, ASJP sub-branching for each of nine respective genetic groups -Mayan, Mixe-Zoque, Otomanguean, Huitotoan-Ocaina, Tacanan, Chocoan, Muskogean, Indo-European, and Austro-Asiatic -agrees substantially with subgrouping for those groups produced by expert historical linguists. Among many other uses, ASJP can be applied to search for possible relationships among languages heretofore not observed or only provisionally recognized. Preliminary ASJP analysis reveals several such possible relationships for languages of Middle and South America. Expanding the ASJP database to all of the world's languages for which 100-word lists can be assembled is a realistic goal that could be achieved in a relatively short period of time, maybe less than a year. STUF, Berlin 61 (2008) 4, 285-308 * We are grateful to André Müller for setting up the website for the references to the sources of the Swadesh lists.We would also like to thank Patience Epps for providing comments relating to the classification of languages of South America. Others who responded with comments to a first draft of this paper or in other important ways deserve our gratitude as well. These include

show abstract

An Introduction to Linguistic Typology

Velupillai

2012

187

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.