2018
DOI: 10.1007/978-3-030-01771-2_13
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting the Web for Semantic Change Detection

Abstract: Detecting significant linguistic shifts in the meaning and usage of words has gained more attention over the last few years. Linguistic shifts are especially prevalent on the Internet, where words' meaning can change rapidly. In this work, we describe the construction of a large diachronic corpus that relies on the UK Web Archive and we propose a preliminary analysis of semantic change detection exploiting a particular technique called Temporal Random Indexing. Results of the evaluation are promising and give … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(18 citation statements)
references
References 16 publications
0
18
0
Order By: Relevance
“…Early work on semantic change detection relied primarily on the comparison of word frequency and co-occurrence patterns between words at different time intervals (Sagi et al, 2009;Cook and Stevenson, 2010;Gulordava and Baroni, 2011), most often representing a single word based on its context (Mihalcea and Nastase, 2012;Jatowt and Duh, 2014;Basile and McGillivray, 2018). Recently, word embeddings have become the common practice for constructing word representations in NLP (Mikolov et al, 2013).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Early work on semantic change detection relied primarily on the comparison of word frequency and co-occurrence patterns between words at different time intervals (Sagi et al, 2009;Cook and Stevenson, 2010;Gulordava and Baroni, 2011), most often representing a single word based on its context (Mihalcea and Nastase, 2012;Jatowt and Duh, 2014;Basile and McGillivray, 2018). Recently, word embeddings have become the common practice for constructing word representations in NLP (Mikolov et al, 2013).…”
Section: Related Workmentioning
confidence: 99%
“…The UK Web Domain Dataset 1996-2013 (JISC-UK) contains textual information published in UK-based websites over the time period 1996-2013, thus facilitating the task of semantic change detection in a short-term and fine-grained temporal resolution (Basile and McGillivray, 2018).…”
Section: Jisc-uk Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…Finally, we measure the level of semantic change of a word by means of the average cosine similarity between the predicted and actual word representations at each time step of the decoder. Model performance is assessed via rank-based metrics (Basile and McGillivray, 2018;Tsakalidis et al, 2019;Shoemark et al, 2019).…”
Section: Artificial Data Experimentsmentioning
confidence: 99%