Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Com 2009
DOI: 10.3115/1620932.1620948
|View full text |Cite
|
Sign up to set email alerts
|

Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training

Abstract: Letter-to-phoneme conversion plays an important role in several applications. It can be a difficult task because the mapping from letters to phonemes can be many-to-many. We present a language independent letter-to-phoneme conversion approach which is based on the popular phrase based Statistical Machine Translation techniques. The results of our experiments clearly demonstrate that such techniques can be used effectively for letter-tophoneme conversion. Our results show an overall improvement of 5.8% over the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…However, this kind of writing for a word imposes challenges in the processing of the text. Letter-to-Phoneme (L2P) approach is suggested in many kinds of literature for solving lingos and slangs [29][30][31]. The CMU Pronouncing Dictionary (CMUDict) [32] is a machine-readable pronunciation dictionary contains over 125,000 words and their transcriptions.…”
Section: ) Lingoes and Slangsmentioning
confidence: 99%
“…However, this kind of writing for a word imposes challenges in the processing of the text. Letter-to-Phoneme (L2P) approach is suggested in many kinds of literature for solving lingos and slangs [29][30][31]. The CMU Pronouncing Dictionary (CMUDict) [32] is a machine-readable pronunciation dictionary contains over 125,000 words and their transcriptions.…”
Section: ) Lingoes and Slangsmentioning
confidence: 99%
“…In this equation, an input sequence seq(g i , x) is constructed by concatenating the focal grapheme g i with its left and right context information, so the length of this sequence is equal to (2x + 1). On the other hand, considering the correspondence between graphemes and phonemes as many-to-many has also been stated as a beneficial technique in many recent studies because it can cover all possible mappings between graphemes and phonemes (e.g., one-to-one, many-toone, one-to-many, and many-to-many) [10], [11], [13], [19]. These techniques inspired us to incorporate the contextdependent phoneme model into neural network-based G2P conversion.…”
Section: Mapping Technique Between Graphemes and Phonemesmentioning
confidence: 99%
“…This corpus, which contains many acronyms and loan words from different languages such as Japanese, French, and German, has been widely used by researchers [11]- [13]. It was originally created using 34 graphemic symbols (e.g., "A".…”
Section: Auto-aligned Cmudict Corpusmentioning
confidence: 99%
See 1 more Smart Citation
“…As a consequence, various many-to-many mapping techniques between letters and phonemes have been proposed subsequently, in order to improve the accuracy of G2P conversion. For example, Rama et al treated the letter-tosound conversion problems as a phrase-based statistical machine translation problem [13]. The hidden Markov model (HMM)-based approach with contextsensitive observations for G2P conversion [14], proposed by Ogbureke et al, obtained a word accuracy of 79.79% on the Unilex corpora containing the UK English words, but only a maximum of 57.85% for the CMUDict corpus [15] due to the large number of loan words and a few remarkable errors.…”
Section: Related Workmentioning
confidence: 99%