2014
DOI: 10.1186/s13636-014-0040-7
|View full text |Cite
|
Sign up to set email alerts
|

A sub-band-based feature reconstruction approach for robust speaker recognition

Abstract: Although the field of automatic speaker or speech recognition has been extensively studied over the past decades, the lack of robustness has remained a major challenge. The missing data technique (MDT) is a promising approach. However, its performance depends on the correlation across frequency bands. This paper presents a new reconstruction method for feature enhancement based on the trait. In this paper, the degree of concentration across frequency bands is measured with principal component analysis (PCA). T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…Features are reconstructed from associated sub-bands, and then recombined. Compared with the full-band method [20], the proposed sub-bands method [31] showed a superior performance. The performances of these systems are mostly dependent on the accuracy of the noise estimation.…”
Section: Robust Features Against Additive Noisementioning
confidence: 90%
See 2 more Smart Citations
“…Features are reconstructed from associated sub-bands, and then recombined. Compared with the full-band method [20], the proposed sub-bands method [31] showed a superior performance. The performances of these systems are mostly dependent on the accuracy of the noise estimation.…”
Section: Robust Features Against Additive Noisementioning
confidence: 90%
“…In [20], an IBM is constructed by estimating local SNR of elements in the time-frequency matrix representation of the speech. Instead of reconstruction, the noisy features using the whole spectrum, sub-bands are considered in [31]. Features are reconstructed from associated sub-bands, and then recombined.…”
Section: Robust Features Against Additive Noisementioning
confidence: 99%
See 1 more Smart Citation
“…Similar to the VAD, vowellike regions are used in [29] and improved in [30] by including the non-vowel-like regions. Missing data approach is also investigated in several studies [31][32][33][34], where a binary time-frequency mask is constructed for the noisy spectrum to indicate reliable and unreliable features. The unreliable features are then reconstructed, or marginalized (ignored in score computation).…”
Section: Introductionmentioning
confidence: 99%