2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4960603
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised speaker adaptation for telephone call transcription

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 8 publications
0
6
0
Order By: Relevance
“…The conventional adaptation techniques can be handled using two types of learning approach: supervised and unsupervised (Matsui and Furui, 1998;Wallace et al, 2009). In supervised adaptation, prior knowledge of adaptation data, such as speaker information or manually defined labels, is required.…”
Section: Conventional Adaptation Approachesmentioning
confidence: 99%
“…The conventional adaptation techniques can be handled using two types of learning approach: supervised and unsupervised (Matsui and Furui, 1998;Wallace et al, 2009). In supervised adaptation, prior knowledge of adaptation data, such as speaker information or manually defined labels, is required.…”
Section: Conventional Adaptation Approachesmentioning
confidence: 99%
“…Transfer learning without any labeled data from the target domain is referred to as unsupervised transfer learning. Motivated by the success of unsupervised transfer learning for speaker adaptation (Chen et al, 2011;Wallace et al, 2009) and spoken document summarization (Lee et al, 2013), we further investigate whether unsupervised transfer learning is feasible for QA.…”
Section: Transfer Learning For Qamentioning
confidence: 99%
“…To date, direct matching of long phonetic sequences has been demonstrated only in rather restricted experimental settings (notably, in English, and with a "topically clumpy" test collection in which people talk about a limited range of predefined topics). Considerable work needs to be done to determine whether the salient phonetic differentiations in specific languages are adequately captured by existing phonetic recognizers, the degree to which we can leverage long-term use of the same mobile phone by the same user to perform unsupervised adaptation to the way a specific searcher (or content producer) speaks [30], whether "query-talk" will match the content to be searched adequately well, and how well these techniques will work with richer and more representative collections.…”
Section: Designmentioning
confidence: 99%