2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015
DOI: 10.1109/asru.2015.7404813
|View full text |Cite
|
Sign up to set email alerts
|

An i-Vector PLDA based gender identification approach for severely distorted and multilingual DARPA RATS data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…An analysis of the trained CNNs showed that depending upon the kernel width of the first convolution layer either formant information or both fundamental frequency and formant information is modeled by the CNNs for gender recognition. In recent years, features such as i-vectors have been used for gender recognition [10]. In [26], a comparison between the proposed CNN-based approach and ivector based approach has been investigated for identifying gender under noisy conditions.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…An analysis of the trained CNNs showed that depending upon the kernel width of the first convolution layer either formant information or both fundamental frequency and formant information is modeled by the CNNs for gender recognition. In recent years, features such as i-vectors have been used for gender recognition [10]. In [26], a comparison between the proposed CNN-based approach and ivector based approach has been investigated for identifying gender under noisy conditions.…”
Section: Resultsmentioning
confidence: 99%
“…Building on this point, typically in the literature [3,4,5,6,7,8], two broad classes of features are used for this task: fundamental frequency (F0) and short term features like mel frequency cepstrum coefficients (MFCCs). There are also works that have investigated high level representations like Gaussian mixture model supervector [9,8] and i-vectors [10].…”
Section: Introductionmentioning
confidence: 99%
“…Gender classification from speech is considered a solved problem on clean and monolingual corpora such as the TIMIT [6] speech corpus or distorted and multilingual corpora such as DARPA RATS [7]. However, differentiating the gender of children from speech is still challenging.…”
Section: Introductionmentioning
confidence: 99%
“…Being an important speaker-specific attribute, gender information is well preserved in the i-Vectors [14]. Most state-ofthe-art SID systems use a gender dependent approach where either part, or all of the SID pipeline can be gender dependent to obtain more competitive results.…”
Section: Introductionmentioning
confidence: 99%