2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854362
|View full text |Cite
|
Sign up to set email alerts
|

Supervised domain adaptation for I-vector based speaker recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
126
0
1

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 121 publications
(129 citation statements)
references
References 5 publications
2
126
0
1
Order By: Relevance
“…The dataset 'SRE-1phn' contains audio from only a single telephone number per speaker and use of such a poor phone number diversity hinders the effective estimation of within-speaker variability of in-domain. In this case, the conventional approaches [13], [14] that estimate within-speaker variability from in-domain unlabeled dataset would fail, in spite of perfect speaker label estimation, due to insufficient channel information. Singer [25] also tackled same issue and suggested dataset selection criteria to prevent this situation in advance.…”
Section: Adaptation Under Insufficient Channel Informationmentioning
confidence: 99%
See 4 more Smart Citations
“…The dataset 'SRE-1phn' contains audio from only a single telephone number per speaker and use of such a poor phone number diversity hinders the effective estimation of within-speaker variability of in-domain. In this case, the conventional approaches [13], [14] that estimate within-speaker variability from in-domain unlabeled dataset would fail, in spite of perfect speaker label estimation, due to insufficient channel information. Singer [25] also tackled same issue and suggested dataset selection criteria to prevent this situation in advance.…”
Section: Adaptation Under Insufficient Channel Informationmentioning
confidence: 99%
“…State-of-the-art techniques from other studies are examined for comparison. For Garcia-Romero's Interpolated approach [13] referred to as system 5 in Table III, the true speaker label is used for ideal case rather than clustering with AHC algorithm. Then, WC and AC from SWB and SRE-1phn are interpolated, as indicated in Table III by "SWB + SRE-1phn".…”
Section: Performance Comparison To State-of-the-art Techniquesmentioning
confidence: 99%
See 3 more Smart Citations