2021
DOI: 10.1007/s12559-021-09850-9
|View full text |Cite
|
Sign up to set email alerts
|

Pronunciation-Enhanced Chinese Word Embedding

Abstract: Chinese word embeddings have recently garnered considerable attention. Chinese characters and their sub-character components, which contain rich semantic information, are incorporated to learn Chinese word embeddings. Chinese characters can represent a combination of meaning, structure, and pronunciation. However, existing embedding learning methods focus on the structure and meaning of Chinese characters. In this study, we aim to develop an embedding learning method that can make complete use of the informati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 51 publications
0
8
0
Order By: Relevance
“…Irregularities in the phoneme-to-grapheme direction are even more prevalent: on average, five different characters represent one tone syllable ( Chen and Pasquarella, 2017 ). Furthermore, there are characters with more than one pronunciation called polyphonic characters ( Yang et al, 2021 ). Examples of polyphonic characters include “了,” which can be pronounced /le4/or/liao/, and “差,” which can be pronounced in four ways: /cha1/, /cha4/, /chai1/, and/ci1/.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Irregularities in the phoneme-to-grapheme direction are even more prevalent: on average, five different characters represent one tone syllable ( Chen and Pasquarella, 2017 ). Furthermore, there are characters with more than one pronunciation called polyphonic characters ( Yang et al, 2021 ). Examples of polyphonic characters include “了,” which can be pronounced /le4/or/liao/, and “差,” which can be pronounced in four ways: /cha1/, /cha4/, /chai1/, and/ci1/.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Due to its success in modelling English documents, word embedding has been applied to Chinese text. Benefiting from the internal structural information of Chinese characters, many studies tried to enhance the quality of Chinese word embeddings with radicals [30][31][32], subword components [33,34], glyph features [35], strokes [36], and pronunciation [37]. To limit the scope of this paper, we choose Skip-gram because, after comparing the word embedding model established by the two corpora used in this experiment, we found Skip-gram to have the best performance on average.…”
Section: The Model Architectures For Word Embeddingmentioning
confidence: 99%
“…Due to its success in modelling English documents, word embedding has been applied to Chinese text. Benefiting from the internal structural information of Chinese characters, many studies tried to enhance the quality of Chinese word embeddings with radicals [30][31][32], sub-word components [33,34], glyph features [35], strokes [36], and pronunciation [37]. To limit the scope of this paper, we choose Skip-gram because, after comparing the word embedding model established by the two corpora used in this experiment, we found Skip-gram to have the best performance on average.…”
Section: The Model Architectures For Word Embeddingmentioning
confidence: 99%