2021
DOI: 10.1007/s11227-021-03743-2
|View full text |Cite
|
Sign up to set email alerts
|

Considerations about learning Word2Vec

Abstract: Despite the large diffusion and use of embedding generated through Word2Vec, there are still many open questions about the reasons for its results and about its real capabilities. In particular, to our knowledge, no author seems to have analysed in detail how learning may be affected by the various choices of hyperparameters. In this work, we try to shed some light on various issues focusing on a typical dataset. It is shown that the learning rate prevents the exact mapping of the co-occurrence matrix, that Wo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 55 publications
(21 citation statements)
references
References 25 publications
0
15
0
Order By: Relevance
“…Word -Word2vec Word2vec [36] is a word embedding approach that can be used to represent words as a vector. The process of creating a word dictionary was carried out utilizing a set of articles from the Python sklearn module's natural language tool kit.…”
Section: Process Resultsmentioning
confidence: 99%
“…Word -Word2vec Word2vec [36] is a word embedding approach that can be used to represent words as a vector. The process of creating a word dictionary was carried out utilizing a set of articles from the Python sklearn module's natural language tool kit.…”
Section: Process Resultsmentioning
confidence: 99%
“…Word2Vec is a popular NLP technique based on neural network to learn word embeddings from 100 billion words [11][12][13]. Google's Word2vec model has the ability to detect synonyms of words or suggest suitable words for an incomplete sentence.…”
Section: Word2vecmentioning
confidence: 99%
“…On Ctrip datasets, TF-IDF, LSI, Word2Vec [ 31 ], and Doc2vec [ 32 ] represent using these methods to extract data features and using Euclidean distance to calculate sample similarity. Jaccard represents using the Jaccard computing method to measure the similarity between text samples.…”
Section: Experiments and Analysismentioning
confidence: 99%