10th International Conference on Information Science, Signal Processing and Their Applications (ISSPA 2010) 2010
DOI: 10.1109/isspa.2010.5605491
|View full text |Cite
|
Sign up to set email alerts
|

Choice of Mel filter bank in computing MFCC of a resampled speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 56 publications
(25 citation statements)
references
References 5 publications
0
25
0
Order By: Relevance
“…Some of these are, for example, the specific type of task to be performed, the characteristics of the data, the application domain and the algorithmic and efficiency requirement ( Guyon et al, 2006 ). For instance, traditional choices of features in the context of IR are those obtained by the SIFT ( Lowe, 2004 ) and SURF ( Bay et al, 2008 ) algorithms, whereas mel-cepstral coefficients ( Davis and Mermelstein, 1980 ; Kopparapu and Laxminarayana, 2010 ) are typically chosen in speech recognition applications.…”
Section: Artificial Intelligence-based Prognostic and Health Managemementioning
confidence: 99%
“…Some of these are, for example, the specific type of task to be performed, the characteristics of the data, the application domain and the algorithmic and efficiency requirement ( Guyon et al, 2006 ). For instance, traditional choices of features in the context of IR are those obtained by the SIFT ( Lowe, 2004 ) and SURF ( Bay et al, 2008 ) algorithms, whereas mel-cepstral coefficients ( Davis and Mermelstein, 1980 ; Kopparapu and Laxminarayana, 2010 ) are typically chosen in speech recognition applications.…”
Section: Artificial Intelligence-based Prognostic and Health Managemementioning
confidence: 99%
“…The calculation of FBEs are as follows: where L is the length of the filter [ 29 ]. Also, is obtained using the proposed formula by [ 44 ] where is the discrete cosine transform. …”
Section: Appendix A1mentioning
confidence: 99%
“…For each frame, FamilyLog first calculates its energy spectrum from 80 Hz to 8 kHz with the Fast Fourier Transform (FFT) [32]. Then the resulting spectrum is transformed into 21 energy channels by applying Mel Filters [21][25][30]. The energy of channel i will be represented as e i hereafter.…”
Section: System Designmentioning
confidence: 99%