2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288993
|View full text |Cite
|
Sign up to set email alerts
|

Grapheme-to-phoneme model generation for Indo-European languages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0
1

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 7 publications
0
11
0
1
Order By: Relevance
“…For languages with highly transparent orthographies such as Spanish or German (Goswani, 1999), g2p approaches typically work quite well (Schlippe et al, 2012). However, for languages with less transparent orthographies, such as English or French (Goswani, 1999), it is relatively difficult to derive simple mappings from the grapheme representation of a syllable to its phoneme representation.…”
Section: Phonetisaurusmentioning
confidence: 99%
See 1 more Smart Citation
“…For languages with highly transparent orthographies such as Spanish or German (Goswani, 1999), g2p approaches typically work quite well (Schlippe et al, 2012). However, for languages with less transparent orthographies, such as English or French (Goswani, 1999), it is relatively difficult to derive simple mappings from the grapheme representation of a syllable to its phoneme representation.…”
Section: Phonetisaurusmentioning
confidence: 99%
“…However, for languages with less transparent orthographies, such as English or French (Goswani, 1999), it is relatively difficult to derive simple mappings from the grapheme representation of a syllable to its phoneme representation. Therefore, g2p approaches tend to work less well (Schlippe et al, 2012).…”
Section: Phonetisaurusmentioning
confidence: 99%
“…They are used to train the systems by de scribing the pronunciation of words according to manageable units, typically phonemes [4]. Dictionaries can also be used to build generalized grapheme-to-phoneme (g2p) models, for the purpose of providing pronunciations for words that do not appear in the dictionary [5]. The production of dictionaries can be time-consuming and expensive if they are manually written by language experts.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, methods have been proposed that employ pronunciations from the collective knowledge on the World Wide Web as training data for g2p models without a crosscheck of language experts [17]. In this case, the training data is expected to include a lot of noisy data, and empirically, in [17], this degrades the performance of the speech recognition system in exchange for improvements of cost and time required for dictionary construction.…”
Section: Introductionmentioning
confidence: 99%
“…In this case, the training data is expected to include a lot of noisy data, and empirically, in [17], this degrades the performance of the speech recognition system in exchange for improvements of cost and time required for dictionary construction. When this sort of noisy data is used to train a g2p system, it is extremely important to have an approach that is highly accurate and robust to overfitting.…”
Section: Introductionmentioning
confidence: 99%