Pseudowords are crucial in (psycho)linguistic research, offering a way to study language without meaning interfering. While various methods for pseudoword creation exist, each has its limitations. Traditional approaches create pseudowords by modifying existing words, potentially leading to unintended recognition of the original words. Modern techniques, using high-frequency bigrams or syllable deconstruction, often require linguistic expertise and may not be tailored specifically for Greek, which is our primary focus. Hence, we introduce SyBig (r)- Morph, a novel algorithm for generating Greek pseudowords with unique features that distinguish it from existing pseudoword generation algorithms.
SyBig (r)-Morph constructs pseudowords using syllables as the main building block, replicating in this manner the linguistic patterns of Greek words. Furthermore, it is flexible and can be applied to extensive wordlists/databases categorizing syllables by position to ensure phonotactic consistency with user-selected morphosyntactic categories. Another important feature of SyBig (r)-Morph is that it employs dual lexicons to eliminate phonologically invalid combinations.
We also outline a specialized protocol for further evaluating the phonological wellformedness of Greek pseudowords, focusing on pseudo-nouns for a production experiment that investigates diverse accentual patterns within and across noun classes. Using an annotated version of the Clean Corpus and the Num Tool, we ensure that these pseudowords adhere to Greek phonotactics and do not closely resemble native words. Selection of essential parameters, including bigram token and type frequency, phonological neighbors, and Levenshtein distance, guarantees accurate phonological well-formedness assessment. This meticulous process results in a curated set of 240 pseudowords for our linguistic experiment.