Improving Lexical Embeddings with Semantic Knowledge

Yu, Mo; Dredze, Mark

doi:10.3115/v1/p14-2089

Cited by 241 publications

(212 citation statements)

References 5 publications

Supporting

Mentioning

211

Contrasting

Order By: Relevance

“…Lexical databases like WordNet or sets of synonyms like MyThes thesaurus can be used during learning or in a post-processing step to specialize word embeddings. For example, Yu and Dredze (2014) include prior knowledge about synonyms from WordNet and the Paraphrase Database in a joint model built upon Word2vec. Faruqui et al (2015) introduce a graph-based retrofitting method where they post-process learned vectors with respect to semantic relationships extracted from additional lexical resources.…”

Section: Using External Resourcesmentioning

confidence: 99%

“…Recent approaches have proposed to tackle this issue using an attentive model for context selection (Ling et al, 2015), or by using external sources -like knowledge graphsin order to improve the embeddings . Similarities derived from such resources are part of the objective function during the learning phase (Yu and Dredze, 2014;Kiela et al, 2015) or used in a retrofitting scheme (Faruqui et al, 2015). These approaches tend to specialize the embeddings to the resource used and its associated similarity measures -while the construction and maintenance of these resources are a set of complex, time-consuming, and error-prone tasks.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dict2vec : Learning Word Embeddings using Lexical Dictionaries

Tissier¹,

Gravier²,

Habrard³

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Learning word embeddings on large unlabeled corpus has been shown to be successful in improving many natural language tasks. The most efficient and popular approaches learn or retrofit such representations using additional external data. Resulting embeddings are generally better than their corpus-only counterparts, although such resources cover a fraction of words in the vocabulary. In this paper, we propose a new approach, Dict2vec, based on one of the largest yet refined datasource for describing words -natural language dictionaries. Dict2vec builds new word pairs from dictionary entries so that semantically-related words are moved closer, and negative sampling filters out pairs whose words are unrelated in dictionaries. We evaluate the word representations obtained using Dict2vec on eleven datasets for the word similarity task and on four datasets for a text classification task.

show abstract

Section: Using External Resourcesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Dict2vec : Learning Word Embeddings using Lexical Dictionaries

Tissier¹,

Gravier²,

Habrard³

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…However, we often need embeddings to be similar only if an exact lexico-semantic relation holds between the words. Numerous methods for specializing word embeddings for particular relations have been proposed (Yu and Dredze, 2014;Faruqui et al, 2015;Kiela et al, 2015;Mrkšić et al, 2016, inter alia), primarily aiming to differentiate synonymic similarity from other types of semantic relatedness.…”

Section: Related Workmentioning

confidence: 99%

“…Yu and Dredze (2014) extend the CBOW objective with synonymy constraints from WordNet and Paraphrase Database (PPDB) (Ganitkevitch et al, 2013). Similarly, Kiela et al (2015) add synonyms as additional contexts for the skip-gram objective.…”

Section: Related Workmentioning

confidence: 99%

“…Consequently, a number of approaches have been proposed for specializing distributional spaces for specific lexico-semantic relations, either by (1) modifying the learning objective or regularization of the original embedding model by incorporating linguistic constraints (Yu and Dredze, 2014;Kiela et al, 2015) or (2) retroactively fitting the pre-trained unspecialized embeddings to linguistic constraints (Faruqui et al, 2015;Mrkšić et al, 2016). However, these methods specialize distributional vector spaces primarily for detecting the symmetric relation of semantic similarity (i.e., graded synonymy) and not for asymmetric lexico-semantic relations such as hypernymy and meronymy.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dual Tensor Model for Detecting Asymmetric Lexico-Semantic Relations

Glavaš¹,

Ponzetto²

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Detection of lexico-semantic relations is one of the central tasks of computational semantics. Although some fundamental relations (e.g., hypernymy) are asymmetric, most existing models account for asymmetry only implicitly and use the same concept representations to support detection of symmetric and asymmetric relations alike. In this work, we propose the Dual Tensor model, a neural architecture with which we explicitly model the asymmetry and capture the translation between unspecialized and specialized word embeddings via a pair of tensors. Although our Dual Tensor model needs only unspecialized embeddings as input, our experiments on hypernymy and meronymy detection suggest that it can outperform more complex and resource-intensive models. We further demonstrate that the model can account for polysemy and that it exhibits stable performance across languages.

show abstract

Topic-Bigram Enhanced Word Embedding Model

Yang

et al. 2018

Neural Information Processing

View full text Add to dashboard Cite

Improving Lexical Embeddings with Semantic Knowledge

Cited by 241 publications

References 5 publications

Dict2vec : Learning Word Embeddings using Lexical Dictionaries

Dict2vec : Learning Word Embeddings using Lexical Dictionaries

Dual Tensor Model for Detecting Asymmetric Lexico-Semantic Relations

Topic-Bigram Enhanced Word Embedding Model

Contact Info

Product

Resources

About