Frequent or contextually predictable words are often phonetically reduced, i.e. shortened and produced with articulatory undershoot. Explanations for phonetic reduction of predictable forms tend to take one of two approaches: Intelligibility-based accounts hold that talkers maximize intelligibility of words that might otherwise be difficult to recognize; productionbased accounts hold that variation reflects the speed of lexical access and retrieval in the language production system. Here we examine phonetic variation as a function of phonological neighborhood density, capitalizing on the fact that words from dense phonological neighborhoods tend to be relatively difficult to recognize, yet easy to produce.We show that words with many phonological neighbors tend to be phonetically reduced (shortened in duration and produced with more centralized vowels) in connected speech, when other predictors of phonetic variation are brought under statistical control. We argue that our findings are consistent with the predictions of production-based accounts of pronunciation variation.
Frequent words tend to shorten (see e.g. Schuchardt 1885, Hooper 1976), as do words that have a high probability of occurrence given a neighboring word (Jurafsky et al. 2001). This tendency has been cited in support of the claim that probabilities are an inherent part of grammar, and of syntax in particular. There is widespread consensus, however, that the syntax of natural languages cannot be captured in terms of item-to-item transitions (Chomsky 1957). Therefore, unless one considers probabilities of syntactic structures, rather than particular combinations of neighboring words, pronunciation variation cannot be said to reflect probabilistic effects in syntax. In this article, we report a case of pronunciation variation that reflects contextual probabilities of syntactic structures. The relevant probabilities are based on the probability of a given syntactic structure, given a particular verb. We showthat these probabilities affect American English /t,d/- deletion, as well as the durations of words and phrases. Our results are consistent with the notion that knowledge of grammar includes knowledge of probabilities of syntactic structures, and that this knowledge affects language production.
Verb subcategorization frequencies (verb biases) have been widely studied in psycholinguistics and play an important role in human sentence processing. Yet available resources on subcategorization frequencies suffer from limited coverage, limited ecological validity, and divergent coding criteria. Prior estimates of verb transitivity, for example, vary widely with corpus size, coverage, and coding criteria. This article provides norming data for 281 verbs of interest to psycholinguistic research, sampled from a corpus of American English, along with a detailed coding manual. We examine the effect on transitivity bias of various coding decisions and methods of computing verb biases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.