2022 3rd International Conference on Pattern Recognition and Machine Learning (PRML) 2022
DOI: 10.1109/prml56267.2022.9882227
|View full text |Cite
|
Sign up to set email alerts
|

Chinese-Uyghur Bilingual Lexicon Induction Based on Morpheme Sequence and Weak Supervision

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…By integrating cross-lingual representations with pre-trained word embeddings in a fully unsupervised initialization on UBLI, the proposed method outperforms existing state-of-the-art methods on low-resource language pairs. Addressing the poor alignment of Chinese-Uyghur cross-language word embeddings due to significant morphological differences, Aysa et al [13] proposed a multilingual morphological analyzer based on a morpheme sequence combined with neural network cross-language word embedding vector mapping, and used for Chinese-Uyghur bilingual dictionary extraction. They used robust morpheme segmentation and stemming of bilingual text data to obtain excellent and meaningful word semantic features.…”
Section: Bilingual Lexicon Inductionmentioning
confidence: 99%
“…By integrating cross-lingual representations with pre-trained word embeddings in a fully unsupervised initialization on UBLI, the proposed method outperforms existing state-of-the-art methods on low-resource language pairs. Addressing the poor alignment of Chinese-Uyghur cross-language word embeddings due to significant morphological differences, Aysa et al [13] proposed a multilingual morphological analyzer based on a morpheme sequence combined with neural network cross-language word embedding vector mapping, and used for Chinese-Uyghur bilingual dictionary extraction. They used robust morpheme segmentation and stemming of bilingual text data to obtain excellent and meaningful word semantic features.…”
Section: Bilingual Lexicon Inductionmentioning
confidence: 99%