One of the central problems in the semantics of derived words is polysemy (see, for example, the recent contributions by Lieber 2016 and Plag et al. 2018 ). In this paper, we tackle the problem of disambiguating newly derived words in context by applying Distributional Semantics ( Firth 1957 ) to deverbal -ment nominalizations (e.g. bedragglement, emplacement). We collected a dataset containing contexts of low frequency deverbal -ment nominalizations (55 types, 406 tokens, see Appendix B) extracted from large corpora such as the Corpus of Contemporary American English. We chose low frequency derivatives because high frequency formations are often lexicalized and thus tend to not exhibit the kind of polysemous readings we are interested in. Furthermore, disambiguating low-frequency words presents an especially difficult task because there is little to no prior knowledge about these words from which their semantic properties can be extrapolated. The data was manually annotated according to eventive vs. non-eventive interpretations, allowing also an ambiguous label in those cases where the context did not disambiguate. Our question then was to what extent, and under which conditions, context-derived representations such as those of Distributional Semantics can be successfully employed in the disambiguation of low-frequency derivatives. Our results show that, first, our models are able to distinguish between eventive and non-eventive readings with some success. Second, very small context windows are sufficient to find the intended interpretation in the majority of cases. Third, ambiguous instances tend to be classified as events. Fourth, the performance of the classifier differed for different subcategories of nouns, with non-eventive derivatives being harder to classify correctly. We present indirect evidence that this is due to the semantic similarity of abstract non-eventive nouns to eventive nouns. Overall, this paper demonstrates that distributional semantic models can be fruitfully employed for the disambiguation of low frequency words in spite of the scarcity of available contextual information. 1
No abstract
Nominalisation is a morphological process producing a noun on the basis of an input that may belong to various categories. As noun is a syntactic category, whether something is a noun can only be decided on the basis of syntactic evidence, not on the basis of its meaning or morphological behaviour. As a theoretical framework, I use Jackendoff’s Parallel Architecture (PA) as a basis, but I argue for a separate word formation component. The central difference between word formation rules and regular lexical entries is that the latter contribute to the production of the representation of interpreted performance, whereas the former produce new lexical entries, thus changing competence. As a consequence, an expression in interpreted performance needs the identification of a reference in the communicative context, whereas word formation needs the identification of a concept in a speaker’s knowledge, which involves onomasiological coercion. A distinction can be made between two types of nominalisation, which I illustrate with Dutch examples. In one type, the meaning is changed, e.g. jager ‘hunter’ from jagen ‘huntv’, in the other it is not, e.g. telling ‘countn’ from tellen ‘countv’. Nominalisations of the latter type are transpositions. This distinction can be made both for deverbal and for deadjectival nouns. Rules changing representations in PA can be classified in seven classes according to which of the structures they modify. Only those that change conceptual structure qualify for being part of the word formation component. This excludes transpositions. Many nouns can be interpreted as either a transposition or a result of word formation. An example is vertaling ‘translation’, which can refer to the process or the result of translation. I argue that there is a word formation rule that produces the second reading on the basis of the first, and show that this rule belongs to a type that is predicted by the typology of rules for modifying representations in PA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.