1996
DOI: 10.1007/bf00115137
|View full text |Cite
|
Sign up to set email alerts
|

Word frequency distributions and lexical semantics

Abstract: This paper addresses the relation between meaning, lexical productivity, and frequency of use. Using density estimation as a visualization tool, we show that differences in semantic structure can be reflected in probability density functions estimated for word frequency distributions. We call attention to an example of a bimodal density, and suggest that bimodality arises when distributions of well-entrenched lexical items, which appear to be lognormal, are mixed with distributions of productively created nonc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

1999
1999
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…To facilitate comparison with surprisal above, frequency effects are represented on a unigram surprisal scale, that is, the negative log prior probability of the word. This measure is equivalent (modulo an additive constant) to the negation of the log frequency scale regularly used in psycholinguistics (Baayen & Lieber, 1996 ; Baayen et al, 2016 ; Balota & Chumbley, 1984 ; Carrol, 1967 ; Demberg & Keller, 2008 ; Norris, 2006 ; Rumelhart & Siple, 1974 ). Unigram surprisals are estimated using KenLM (Heafield et al, 2013 ) with default hyperparameters.…”
Section: Methodsmentioning
confidence: 99%
“…To facilitate comparison with surprisal above, frequency effects are represented on a unigram surprisal scale, that is, the negative log prior probability of the word. This measure is equivalent (modulo an additive constant) to the negation of the log frequency scale regularly used in psycholinguistics (Baayen & Lieber, 1996 ; Baayen et al, 2016 ; Balota & Chumbley, 1984 ; Carrol, 1967 ; Demberg & Keller, 2008 ; Norris, 2006 ; Rumelhart & Siple, 1974 ). Unigram surprisals are estimated using KenLM (Heafield et al, 2013 ) with default hyperparameters.…”
Section: Methodsmentioning
confidence: 99%
“…To facilitate comparison with surprisal above, frequency effects are represented on a unigram surprisal scale, that is, the negative log prior probability of the word. This measure is equivalent (modulo an additive constant) to the negation of the log frequency scale regularly used in psycholinguistics (Carrol, 1967;Rumelhart and Siple, 1974;Balota and Chumbley, 1984;Baayen and Lieber, 1996;Norris, 2006;Demberg and Keller, 2008;Baayen et al, 2016). Unigram surprisals are estimated using KenLM (Heafield et al, 2013) with default hyperparameters.…”
Section: Critical Predictorsmentioning
confidence: 99%