2019
DOI: 10.1016/j.eswa.2019.06.021
|View full text |Cite
|
Sign up to set email alerts
|

Cross-lingual word analogies using linear transformations between semantic spaces

Abstract: We generalize the word analogy task across languages, to provide a new intrinsic evaluation method for cross-lingual semantic spaces. We experiment with six languages within different language families, including English, German, Spanish, Italian, Czech, and Croatian. State-of-the-art monolingual semantic spaces are transformed into a shared space using dictionaries of word translations. We compare several linear transformations and rank them for experiments with monolingual (no transformation), bilingual (one… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…Our software allows using of two methods of transformations. The transformation can be done by a modification of the orthogonal transformation [2] or by Canonical Correlation Analysis (CCA) using the implementation from [3]. The goal of the transformation is shown in Fig.…”
Section: Approach Descriptionmentioning
confidence: 99%
“…Our software allows using of two methods of transformations. The transformation can be done by a modification of the orthogonal transformation [2] or by Canonical Correlation Analysis (CCA) using the implementation from [3]. The goal of the transformation is shown in Fig.…”
Section: Approach Descriptionmentioning
confidence: 99%
“…We train a separate semantic space for each corpus, and subsequently, we map these two spaces into one common cross-lingual space. We use methods for cross-lingual mapping (Brychcín et al, 2019;Artetxe et al, 2016;Artetxe et al, 2018a;Artetxe et al, 2018b) and thanks to the large similarity between L 1 and L 2 the quality of transformation should be high. We compute cosine similarity of the transformed word vectors to classify whether the target words changed their sense.…”
Section: Overviewmentioning
confidence: 99%
“…We perform a cross-lingual mapping of the two vector spaces, getting two matrices Xs and Xt projected into a shared space. We select two methods for the crosslingual mapping Canonical Correlation Analysis (CCA) using the implementation from (Brychcín et al, 2019) and a modification of the Orthogonal Transformation from VecMap (Artetxe et al, 2018b). Both of these methods are linear transformations.…”
Section: System Descriptionmentioning
confidence: 99%
“…We train separate semantic space for each corpus and subsequently, we map these two spaces into one common cross-lingual space. We use methods for cross-lingual mapping (Brychcín et al, 2019;Artetxe et al, 2016;Artetxe et al, 2017;Artetxe et al, 2018a;Artetxe et al, 2018b) and thanks to the large similarity between L 1 and L 2 the quality of transformation should be high. We compute cosine similarity to classify and rank the target words, see Section 3 for details.…”
Section: Introductionmentioning
confidence: 99%