2011
DOI: 10.1007/s12530-011-9034-1
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive systems for unsupervised speaker tracking and speech recognition

Abstract: Speech recognition offers an intuitive and convenient interface to control technical devices. Improvements achieved through ongoing research activities enable the user to handle increasingly complex tasks via speech. For special applications, e.g. dictation, highly sophisticated techniques have been developed to yield high recognition accuracy. Many use cases, however, are characterized by changing conditions such as different speakers or time-variant environments. A manifold of approaches has been published t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…In essence, it is a Bayesian evolving system that adapts to the model of the input audio sources. In [23], Herbig et al proposed an adaptive scheme for a speech recognition system so as to adapt to new speakers. This is similar to the parameter β j (t), which essentially adapts to each source signal, in order to sort out the permutation ambiguity.…”
Section: Likelihood Ratio Jumpmentioning
confidence: 99%
“…In essence, it is a Bayesian evolving system that adapts to the model of the input audio sources. In [23], Herbig et al proposed an adaptive scheme for a speech recognition system so as to adapt to new speakers. This is similar to the parameter β j (t), which essentially adapts to each source signal, in order to sort out the permutation ambiguity.…”
Section: Likelihood Ratio Jumpmentioning
confidence: 99%
“…Combining speaker adaptation and speaker tracking may be advantageous, because it allows a system to adapt to more than one user at the same time. Authors in [21] have extended a standard speech recognizer by combining speaker specific speech decoding with speaker identification in an efficient manner. Approximately 20% relative error rate reduction and about 94.6% identification rate are reported.…”
Section: Introductionmentioning
confidence: 99%