2023
DOI: 10.1109/access.2023.3321798
|View full text |Cite
|
Sign up to set email alerts
|

Development of Parametric Filter Banks for Sound Feature Extraction

Xiangyu Cai,
Sunwoo Ko

Abstract: A kind of learnable parametric filter banks is proposed in this paper. Parametric filter banks refer to selecting learnable parameters from the original filter banks and learning a parameter filter banks that adapts to the current dataset through the learning ability of a neural network. We use three types of filter banks, including the popular Mel filter banks, the Gammatone filter banks that mimics the response of the human auditory filter in the cochlea, and our own Gaussian filter banks. The performance ev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…As f c /b is increased (for fixed order), the single peak splits and the maxima move outwards and eventually converges to ± f c . Substituting all the derived terms in (22) we write the final expression of the power spectrum as…”
Section: Observation 2 the Fourier Transform Of The Gamma Distributio...mentioning
confidence: 99%
See 1 more Smart Citation
“…As f c /b is increased (for fixed order), the single peak splits and the maxima move outwards and eventually converges to ± f c . Substituting all the derived terms in (22) we write the final expression of the power spectrum as…”
Section: Observation 2 the Fourier Transform Of The Gamma Distributio...mentioning
confidence: 99%
“…They demonstrated that GTFs are promising in terms of improving the robustness of ASR systems against noise compared to the Mel-Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP). GTF-based parametric filter banks have been proposed in [22] to detect speech. Three filter banks based on Mel, Gammatone, and Gaussian filters have been investigated in that work.…”
Section: Introductionmentioning
confidence: 99%