Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing 2017
DOI: 10.18653/v1/d17-1207
|View full text |Cite
|
Sign up to set email alerts
|

Earth Mover's Distance Minimization for Unsupervised Bilingual Lexicon Induction

Abstract: Cross-lingual natural language processing hinges on the premise that there exists invariance across languages. At the word level, researchers have identified such invariance in the word embedding semantic spaces of different languages. However, in order to connect the separate spaces, cross-lingual supervision encoded in parallel data is typically required. In this paper, we attempt to establish the cross-lingual connection without relying on any cross-lingual supervision. By viewing word embedding spaces as d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
107
0
2

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 117 publications
(110 citation statements)
references
References 37 publications
1
107
0
2
Order By: Relevance
“…There is evidence that different languages represent semantic concepts with similar structure leading to the structural isomorphism across word embedding spaces of different languages [3]. Viewing word embedding spaces as distributions, Zhang et al proposed to build the cross-lingual connection by minimizing their earth movers distance [4]. Differently, they achieve the alignment only by minimizing the distance between distributions while we find the combination of distribution-level and item-level distance minimization can better align the distributions.…”
Section: Introductionmentioning
confidence: 88%
See 1 more Smart Citation
“…There is evidence that different languages represent semantic concepts with similar structure leading to the structural isomorphism across word embedding spaces of different languages [3]. Viewing word embedding spaces as distributions, Zhang et al proposed to build the cross-lingual connection by minimizing their earth movers distance [4]. Differently, they achieve the alignment only by minimizing the distance between distributions while we find the combination of distribution-level and item-level distance minimization can better align the distributions.…”
Section: Introductionmentioning
confidence: 88%
“…where Γ(P 1 , P 2 ) denotes the set of all joint distributions γ(x, y) with marginals P 1 and P 2 . It can be considered as the continuous case of the Earth Mover's Distance, a powerful tool widely used in computer vision and natural language processing [4,8]. Intuitively, if each distribution is viewed as a unit amount of "dirt", earth mover's distance is the minimum "cost" of turning one pile into the other, which is assumed to The framework of SC-GANs.…”
Section: Preliminariesmentioning
confidence: 99%
“…For the first application, we have 5 pairs of languages: Chinese-English, Spanish-English, Italian-English, Japanese-Chinese, and Turkish-English. Given by (Zhang et al 2017), each language has a vocabulary list containing 3000 to 13000 words; we also follow their preprocessing idea to represent all the words by vectors in R 50 through the embedding technique (Mikolov, Le, and Sutskever 2013). Actually, each vocabulary list is represented by a distribution in the space where each vector has the weight equal to the corresponding frequency in the language.…”
Section: Methodsmentioning
confidence: 99%
“…The amount of required supervision was later reduced through self-learning methods (Artetxe et al, 2017), and then completely eliminated through adversarial training (Zhang et al, 2017a; or more robust iterative approaches combined with initialization heuristics (Artetxe et al, 2018b;Hoshen and Wolf, 2018). At the same time, several recent methods have formulated embedding mappings as an optimal transport problem (Zhang et al, 2017b;.…”
Section: Related Workmentioning
confidence: 99%