Interspeech 2013 2013
DOI: 10.21437/interspeech.2013-451
|View full text |Cite
|
Sign up to set email alerts
|

G2p variant prediction techniques for ASR and STD

Abstract: Introducing pronunciation variants into a lexicon is a balancing act: incorporating necessary variants can improve automatic speech recognition (ASR) and spoken term detection (STD) performance by capturing some of the variability that occurs naturally; introducing superfluous variants can lead to increased confusability and a decrease in performance. We experiment with two very different grapheme-to-phoneme variant prediction techniques and analyze the variants generated, as well as their effect when used wit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 13 publications
0
5
0
Order By: Relevance
“…Also, we employed log-probability voting to select the final language of origin of a word as described in [14]. Training data was obtained from the NCHLT-inlang [13,5] dictionaries, providing 15,000 unique words per language. Data was preprocessed by removing words with fewer than two characters, converting all characters to lowercase and removing any loan words that are easily identifiable based on any foreign characters included in the word.…”
Section: T-lid Using Jsmsmentioning
confidence: 99%
“…Also, we employed log-probability voting to select the final language of origin of a word as described in [14]. Training data was obtained from the NCHLT-inlang [13,5] dictionaries, providing 15,000 unique words per language. Data was preprocessed by removing words with fewer than two characters, converting all characters to lowercase and removing any loan words that are easily identifiable based on any foreign characters included in the word.…”
Section: T-lid Using Jsmsmentioning
confidence: 99%
“…The G2P accuracy for variant-based dictionaries is analysed using four metrics defined in [1], namely variantbased phone accuracy (V-PA), variant-based word accuracy (V-WA), single-best phone accuracy (S-PA) and single-best word accuracy (S-WA). (Also see [2].…”
Section: A Performance Measuresmentioning
confidence: 99%
“…The dictionary is typically developed manually, using human expertise, or created in a data-driven manner from a training sample. Capturing acoustic variabilities, such as dialect, differences in semantics or accents, could result in incorporating different pronunciation variants into the dictionary [1]. Capturing pronunciation variation in a dictionary is controversial as, in practice, it can either increase or decrease ASR performance.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations