2022
DOI: 10.3233/jifs-212179
|View full text |Cite|
|
Sign up to set email alerts
|

RETRACTED: Testing of statistical significance of semantic changes detected by diachronic word embedding

Abstract: In recent years, methods based on word embedding models have been widely used for solving problems of semantic change estimation. The models are trained on text corpora of various years. Semantic change is detected by analyzing changes in distance between words using vector space alignment or by analyzing changes in a set of words that are most similar in meaning to a target word. Testing for statistical significance of the detected effects has not been detailly discussed in previous studies. This paper focuse… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…In such case, the correctness of the conclusions made in the work depends on the validity of the assumption put forward. Note that there is no theoretical argument in favor of the power-law dependence in the literature for this case, and the results of model experiments also do not confirm it (see [33]).…”
Section: Accounting For the Impact Of Word Frequencymentioning
confidence: 73%
See 1 more Smart Citation
“…In such case, the correctness of the conclusions made in the work depends on the validity of the assumption put forward. Note that there is no theoretical argument in favor of the power-law dependence in the literature for this case, and the results of model experiments also do not confirm it (see [33]).…”
Section: Accounting For the Impact Of Word Frequencymentioning
confidence: 73%
“…It is known that for all types of word embeddings, semantic difference estimates strongly depend on word frequency [3,5,32,31,33]. As pointed out in [5], this effect must necessarily be taken into account when comparing the rate of semantic changes for different words.…”
Section: Accounting For the Impact Of Word Frequencymentioning
confidence: 99%
“…In addition to the model parameters and settings in the training process, a significant role in estimating the quality of models can be played by random distribution of weights when initializing the model at the beginning of the training. Work [9] shows that repeated training of word2vec models allows one to estimate confidence intervals for analyzing significance of changes in semantic distances between words.…”
Section: Introductionmentioning
confidence: 99%