Probability and surprisal in auditory comprehension of morphologically complex words

Balling, Laura Winther; Baayen, R. Harald

doi:10.1016/j.cognition.2012.06.003

Cited by 72 publications

(50 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Recently, morphologically sensitive measures of UP have also been defined and positively assessed as predictors of lexical processing. For example, Balling and Baayen (2012) define the complex uniqueness point (CUP) as the point at which a suffixed word becomes uniquely distinguishable from all words that share the same stem, therefore considering derived morphological continuations as (morphological) competitors during recognition. Wurm (1997) focuses on the importance of prefixes to spoken word recognition and formulates the conditional root uniqueness point (CRUP) as the uniqueness point of the root given a particular prefix.…”

Section: Routes To Word Recognitionmentioning

confidence: 99%

Non-linear processing of a linear speech stream: The influence of morphological structure on the recognition of spoken Arabic words

Gwilliams

Marantz

2015

Brain and Language

View full text Add to dashboard Cite

Although the significance of morphological structure is established in visual word processing, its role in auditory processing remains unclear. Using magnetoencephalography we probe the significance of the root morpheme for spoken Arabic words with two experimental manipulations. First we compare a model of auditory processing that calculates probable lexical outcomes based on whole-word competitors, versus a model that only considers the root as relevant to lexical identification. Second, we assess violations to the root-specific Obligatory Contour Principle (OCP), which disallows root-initial consonant gemination. Our results show root prediction to significantly correlate with neural activity in superior temporal regions, independent of predictions based on whole-word competitors. Furthermore, words that violated the OCP constraint were significantly easier to dismiss as valid words than probability-matched counterparts. The findings suggest that lexical auditory processing is dependent upon morphological structure, and that the root forms a principal unit through which spoken words are recognised.

show abstract

Section: Routes To Word Recognitionmentioning

confidence: 99%

Non-linear processing of a linear speech stream: The influence of morphological structure on the recognition of spoken Arabic words

Gwilliams

Marantz

2015

Brain and Language

View full text Add to dashboard Cite

show abstract

“…There are indications that males and females may be differentially sensitive to word frequency (Ullman et al, 2002;Balling and Baayen, 2008), but a gender by frequency interaction is not always found (Balling and Baayen, 2012;Tabak et al, 2005Tabak et al, , 2010. As the baldey data set combines a perfectly balanced set of subjects (10 males and 10 females) with a large number of items (2780 Dutch words), it provides a testing ground for differential effects of the two genders in lexical processing.…”

Section: The Baldey Datasetmentioning

confidence: 99%

The cave of shadows: Addressing the human factor with generalized additive mixed models

Baayen

Vasishth

Kliegl

et al. 2017

Journal of Memory and Language

206

201

View full text Add to dashboard Cite

Generalized additive mixed models are introduced as an extension of the generalized linear mixed model which makes it possible to deal with temporal autocorrelational structure in experimental data. This autocorrelational structure is likely to be a consequence of learning, fatigue, or the ebb and flow of attention within an experiment (the 'human factor'). Unlike molecules or plots of barley, subjects in psycholinguistic experiments are intelligent beings that depend for their survival on constant adaptation to their environment, including the environment of an experiment. Three data sets illustrate that the human factor may interact with predictors of interest, both factorial and metric. We also show that, especially within the framework of the generalized additive model, in the nonlinear world, fitting maximally complex models that take every possible contingency into account is ill-advised as a modeling strategy. Alternative modeling strategies are discussed for both confirmatory and exploratory data analysis.

show abstract

“…We computed the positions of two different identification points. We refer to the first one as the lemma identification point (LIP); it is similar to the uniqueness point defined by Marslen-Wilson (1980), being the phoneme after which the only remaining lexical candidates are morphological continuation forms of the (prefix plus) stem (see also Balling & Baayen, 2012). An example in our corpus is bananen, "bananas", with the LIP at the second [n], at which point either the plural form of the stimulus or its singular banaan, "banana", is possible, but the competitor banaal, "banal", is no longer possible (note that this example also works for English).…”

Section: Illustrative Analyses Of the Database: Analysis 1 What Is Tmentioning

confidence: 99%

BALDEY: A database of auditory lexical decisions

Ernestus

Cutler

2015

Quarterly Journal of Experimental Psychology

View full text Add to dashboard Cite

In an auditory lexical decision experiment, 5541 spoken content words and pseudowords were presented to 20 native speakers of Dutch. The words vary in phonological make-up and in number of syllables and stress pattern, and are further representative of the native Dutch vocabulary in that most are morphologically complex, comprising two stems or one stem plus derivational and inflectional suffixes, with inflections representing both regular and irregular paradigms; the pseudowords were matched in these respects to the real words. The BALDEY ("biggest auditory lexical decision experiment yet") data file includes response times and accuracy rates, with for each item morphological information plus phonological and acoustic information derived from automatic phonemic segmentation of the stimuli. Two initial analyses illustrate how this data set can be used. First, we discuss several measures of the point at which a word has no further neighbours and compare the degree to which each measure predicts our lexical decision response outcomes. Second, we investigate how well four different measures of frequency of occurrence (from written corpora, spoken corpora, subtitles, and frequency ratings by 75 participants) predict the same outcomes. These analyses motivate general conclusions about the auditory lexical decision task. The (publicly available) BALDEY database lends itself to many further analyses.

show abstract

Probability and surprisal in auditory comprehension of morphologically complex words

Cited by 72 publications

References 52 publications

Non-linear processing of a linear speech stream: The influence of morphological structure on the recognition of spoken Arabic words

Non-linear processing of a linear speech stream: The influence of morphological structure on the recognition of spoken Arabic words

The cave of shadows: Addressing the human factor with generalized additive mixed models

BALDEY: A database of auditory lexical decisions

Contact Info

Product

Resources

About