Wuggy: A multilingual pseudoword generator

Keuleers, Emmanuel; Brysbaert, Marc

doi:10.3758/brm.42.3.627

Cited by 563 publications

(435 citation statements)

References 17 publications

Supporting

Mentioning

429

Contrasting

Unclassified

Order By: Relevance

“…Thus, there were 50 word pairs in the related condition and 50 word pairs in the unrelated condition in each list. An additional set of 100 orthographically legal non-words in English was also created using Wuggy (Keuleers & Brysbaert, 2010). These non-words were preceded by a Spanish prime word of the same length (plus/minus one) as the target English non-word.…”

Section: Methodsmentioning

confidence: 99%

Non-cognate translation priming effects in the same–different task: evidence for the impact of “higher level” information

Lupker

Perea

Nakayama

2015

Language, Cognition and Neuroscience

View full text Add to dashboard Cite

Norris and colleagues have proposed that priming effects observed in the masked prime same-different task are based solely on pre-lexical orthographic information. This proposal was evaluated by examining translation priming effects from non-cognate translation equivalents using both Spanish-English and Japanese-English bilinguals in the same-different task. Although no priming was observed for Spanish-English bilinguals, who also produced very little translation priming in a lexical decision task, significant priming was observed for Japanese-English bilinguals. These results indicate that, although most of the priming in the same-different task has an orthographic basis, other types of priming effects can emerge. Therefore, while the masked prime same-different task provides a good way of investigating the nature of orthographic coding, it, like the sandwich priming technique, can also be influenced by higher level information.

show abstract

Section: Methodsmentioning

confidence: 99%

Non-cognate translation priming effects in the same–different task: evidence for the impact of “higher level” information

Lupker

Perea

Nakayama

2015

Language, Cognition and Neuroscience

View full text Add to dashboard Cite

show abstract

“…On the basis of simulations with the British National Corpus, Brysbaert and New estimated that, when used to predict word processing times, larger corpora yield significantly better frequency estimates up to a corpus size of about 16 million words, but that, for larger corpus sizes, the gains become vanishingly small if the corpus has been well sampled. 2 The Wuggy pseudoword generator (Keuleers & Brysbaert, 2010) was used to construct a corresponding pseudoword for each word in the experiment. Each pseudoword differed from the reference word by one subsyllabic segment (i.e., the onset, nucleus, or coda) per syllable.…”

Section: Edited Texts May Not Be the Best Source Of Information For Wmentioning

confidence: 99%

SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles

Keuleers

Brysbaert

New

2010

Behavior Research Methods

Self Cite

490

449

View full text Add to dashboard Cite

One of the most important predictors of word processing times is the frequency with which words have been encountered. In large-scale studies, word frequency (WF) reliably explains the largest percentage of variance of any predictor of word processing times (e.g., Baayen, Feldman, & Schreuder, 2006;Balota, Cortese, Sergent-Marshall, Spieler, & Yap, 2004; Yap & Balota, 2009). Therefore, psycholinguists have invested time in the collection of WF measures. The first list of word frequencies widely used in language research was published in English by Thorndike and Lorge (1944; see Bontrager, 1991, for a review of older frequency lists including German ones). Its main motivation was educational (helping teachers decide which words should be taught to pupils). A few decades later, Ku era and Francis (1967; KF) published a list (also for American English) that would become the frequency measure of choice for language researchers up to the present (Brysbaert & New, 2009).For the Dutch language, van Berckel, Brandt Corstius, Mokken, and van Wijngaarden (1965) collected word frequencies based on a newspaper corpus of about 50,000 words. Although this list contained additional statistical information, such as ngram sequences up to three letters, about the Dutch language, it did not gain wide adoption. The first publicly available frequency list for Dutch was edited by Uit den Boogaart (1975), who published frequencies of "written and spoken Dutch" based on a corpus of 605,733 words from written sources and 121,569 words from spoken sources. This book was superseded in 1993, when the Centre for Lexical Information (CELEX) published frequencies based on a 42-million-word corpus of written texts collected by the Institute for Dutch Lexicology (Baayen, Piepenbrock, & van Rijn, 1993). In addition to the frequencies of the different forms (e.g., play, plays), the CELEX database also contained the frequencies of the words as different parts of speech ( play as a noun vs. play as a verb) and the frequencies of the headwords or lemmas (e.g., the frequency of the nominal lemma play consisting of the summed frequency of the word form play as a noun and the word form plays as a noun). Since its publication, CELEX has been the primary source of word frequencies and other lexical information for the Dutch language. 1 For a long time, face validity was the main factor in assessing the quality of a frequency measure for research in word recognition. Two criteria were of importance: the representativeness of the sources and the size of the corpus. On both criteria, CELEX scored well. Special care had been taken to select texts from a wide variety of documents produced by the Dutch-speaking community, and the size of the corpus was larger than what was available in most other languages. However, in the past 2 years, researchers have started to measure the validity of word frequencies for research into word recognition processes by correlating them with word processing times for thousands of words. This research has revealed considerable qu...

show abstract

“…All pseudoword stimuli were constructed using the Wuggy pseudoword generator (Keuleers & Brysbaert, 2010) and were five-letter monosyllables. The two critical target letters were the vowels A and O and they 2 Many participants from Experiment 1 could not be assessed on this test of vocabulary due to testing problems and so, comparisons of proficiency scores between experiments 1 and 2 cannot be provided.…”

Section: Materials and Proceduresmentioning

confidence: 99%