2010 20th International Conference on Pattern Recognition 2010
DOI: 10.1109/icpr.2010.1082
|View full text |Cite
|
Sign up to set email alerts
|

Auditory Features Revisited for Robust Speech Recognition

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 10 publications
0
2
0
Order By: Relevance
“…In rows (1) and (2) Caranica et al [47] used supervised phoneme HMM tokens trained on a labeled Romanian corpus of 8.7 hours. The results of row (1) were trained with MFCCs, and row (2) with PNCCs [53]. Their systems are similar to the proposed approach here because we both use token HMMs.…”
Section: Spoken Term Detection Experimentsmentioning
confidence: 99%
“…In rows (1) and (2) Caranica et al [47] used supervised phoneme HMM tokens trained on a labeled Romanian corpus of 8.7 hours. The results of row (1) were trained with MFCCs, and row (2) with PNCCs [53]. Their systems are similar to the proposed approach here because we both use token HMMs.…”
Section: Spoken Term Detection Experimentsmentioning
confidence: 99%
“…The resulting feature set has been shown to lead to a better recognition accuracy than MFCC or RASTA-PLP processing in the presence of common additive noise types, and reverberation [113]. It was also heroded as the new feature set that is the most promising for robust ASR [108,109]. However the noise robustness of the PNCC has a downside because it has a lower performance (compared to MFCC or PLP) under clean speech conditions [168].…”
Section: Power Normalised Cepstral Coefficients (Pncc)mentioning
confidence: 99%