2014
DOI: 10.5614/itbj.ict.res.appl.2014.8.2.6
|View full text |Cite
|
Sign up to set email alerts
|

New Grapheme Generation Rules for Two-Stage Modelbased Grapheme-to-Phoneme Conversion

Abstract: The precise conversion of arbitrary text into its corresponding phoneme sequence (grapheme-to-phoneme or G2P conversion) is implemented in speech synthesis and recognition, pronunciation learning software, spoken term detection and spoken document retrieval systems. Because the quality of this module plays an important role in the performance of such systems and many problems regarding G2P conversion have been reported, we propose a novel two-stage model-based approach, which is implemented using an existing w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Over the last few years, it has proven considerably difficult to improve the performance of a G2P conversion model for OOV words, because each method or approach is uniquely designed using different techniques to address particular challenges. It is seemingly impossible to utilize any single method to deal comprehensively with the host of problems encountered by G2P conversion [12]. Therefore, we designed a PTN-based architecture for G2P conversion that allows many different methods to be applied together, in order to deal broadly with the various problems.…”
Section: Ptn Generation Using Multiple Phoneme Sequencesmentioning
confidence: 99%
See 4 more Smart Citations
“…Over the last few years, it has proven considerably difficult to improve the performance of a G2P conversion model for OOV words, because each method or approach is uniquely designed using different techniques to address particular challenges. It is seemingly impossible to utilize any single method to deal comprehensively with the host of problems encountered by G2P conversion [12]. Therefore, we designed a PTN-based architecture for G2P conversion that allows many different methods to be applied together, in order to deal broadly with the various problems.…”
Section: Ptn Generation Using Multiple Phoneme Sequencesmentioning
confidence: 99%
“…Even if the reversed g-p sequences can make a complementary model that can generate an additional phonemesequence hypothesis for each source method, the risk from combining different methods nevertheless remains. Hence, we introduce a novel use of grapheme generation rules (GGRs) [12] to minimize this risk. This allows us to use only a single method for implementing a PTN-based G2P conversion model.…”
Section: Reducing the Number Of Required Source Modelsmentioning
confidence: 99%
See 3 more Smart Citations