2019
DOI: 10.5281/zenodo.3585027
|View full text |Cite
|
Sign up to set email alerts
|

Models for "A data-driven approach to studying changing vocabularies in historical newspaper collections"

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…They were computed from many different corpora by using word2vec skip-gram with negative sampling. Later on, Hengchen et al [83] released a set of diachronic embeddings of the same type in English, Dutch, Finnish and Swedish trained on large corpora of 19C-20C newspapers. 30 More recently, Hengchen et al [84] pursued these efforts with the publication of diachronic word2vec and fastText models trained on a large corpus of Swedish OCRed newspapers (1645-1926) (the Kubhist 2 corpus, 5.5 billion tokens).…”
Section: Static Embeddingsmentioning
confidence: 99%
“…They were computed from many different corpora by using word2vec skip-gram with negative sampling. Later on, Hengchen et al [83] released a set of diachronic embeddings of the same type in English, Dutch, Finnish and Swedish trained on large corpora of 19C-20C newspapers. 30 More recently, Hengchen et al [84] pursued these efforts with the publication of diachronic word2vec and fastText models trained on a large corpus of Swedish OCRed newspapers (1645-1926) (the Kubhist 2 corpus, 5.5 billion tokens).…”
Section: Static Embeddingsmentioning
confidence: 99%