Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018
DOI: 10.18653/v1/d18-1033
|View full text |Cite
|
Sign up to set email alerts
|

Cross-lingual Lexical Sememe Prediction

Abstract: Sememes are defined as the minimum semantic units of human languages. As important knowledge sources, sememe-based linguistic knowledge bases have been widely used in many NLP tasks. However, most languages still do not have sememe-based linguistic knowledge bases. Thus we present a task of cross-lingual lexical sememe prediction, aiming to automatically predict sememes for words in other languages. We propose a novel framework to model correlations between sememes and multilingual words in low-dimensional sem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 24 publications
(18 citation statements)
references
References 34 publications
0
18
0
Order By: Relevance
“…Sememe prediction is a well-defined task Jin et al, 2018;Qi et al, 2018), aimed at selecting appropriate sememes for unannotated words or phrases from the set of all the sememes. Existing works model sememe prediction as a multi-label classification problem, where sememes are regarded as the labels of words and phrases.…”
Section: Training For Mwe Sememe Predictionmentioning
confidence: 99%
See 1 more Smart Citation
“…Sememe prediction is a well-defined task Jin et al, 2018;Qi et al, 2018), aimed at selecting appropriate sememes for unannotated words or phrases from the set of all the sememes. Existing works model sememe prediction as a multi-label classification problem, where sememes are regarded as the labels of words and phrases.…”
Section: Training For Mwe Sememe Predictionmentioning
confidence: 99%
“…We use the above-mentioned test set for evaluation. As for the evaluation protocol, we adopt mean average precision (MAP) and F1 score following previous sememe prediction works Qi et al, 2018). Since our SC models and baseline methods yield a score for each se-meme in the whole sememe set, we pick the sememes with scores higher than δ to compute F1 score, where δ is a hyper-parameter and also tuned to the best on the validation set.…”
Section: Evaluation Dataset and Protocolmentioning
confidence: 99%
“…Some work tries to expand HowNet by predicting sememes for new words (Xie et al 2017;Jin et al 2018). To the best of our knowledge, only Qi et al (2018) make an attempt to build a sememe KB for another language by cross-lingual lexical sememe prediction (CLSP). They learn bilingual word embeddings in a unified semantic space, and then predict sememes for target words according to their meaning-similar words in the sememeannotated language.…”
Section: Related Workmentioning
confidence: 99%
“…However, building a sememe KB for a new language from scratch is time-consuming and laborintensive -the construction of HowNet takes several linguistic experts more than two decades. To tackle this challenge, Qi et al (2018) present the task of cross-lingual lexical sememe prediction (CLSP), aiming to facilitate the construction of a new language's sememe KB by predicting sememes for words in that language. However, CLSP can pre-A BabelNet synset sememes annotate bn:00045106n en: husband, hubby zh: , , , fr: mari, époux, marié de: Ehemann, Gemahl, Gatte …… human family male spouse Figure 2: Annotating sememes for the BabelNet synset whose ID is bn:00045106n.…”
Section: Introductionmentioning
confidence: 99%
“…The method can alleviate the problem of large errors in the word vectors for words with fewer frequencies in the corpus. Based on the complementarity of different languages, Qi, F., et al [24] establishes the association between semantics and cross-lingual words in the low-dimensional semantic space, and thus improves the ability of semantics prediction. Although the above work is very innovative, the employed knowledge is not very closed with sememes, and there is still a gap between the predicted results and the sememes that should be assigned.…”
Section: Related Workmentioning
confidence: 99%