2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT) 2013
DOI: 10.1109/icccnt.2013.6850234
|View full text |Cite
|
Sign up to set email alerts
|

Content based audio retrieval with MFCC feature extraction, clustering and sort-merge techniques

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…The Mel-frequency cepstrum has proven to be highly effective in recognizing structure of music signals and in modeling the subjective pitch and frequency content of audio signals [3]. MFCCs are based on the known variation of the human ears critical bandwidths with frequency, filters spaced linearly at low frequencies and logarithmically at high frequencies to capture the phonetically important characteristics of speech and audio [4].…”
Section: Mel-frequency Cepstral Coefficientsmentioning
confidence: 99%
See 1 more Smart Citation
“…The Mel-frequency cepstrum has proven to be highly effective in recognizing structure of music signals and in modeling the subjective pitch and frequency content of audio signals [3]. MFCCs are based on the known variation of the human ears critical bandwidths with frequency, filters spaced linearly at low frequencies and logarithmically at high frequencies to capture the phonetically important characteristics of speech and audio [4].…”
Section: Mel-frequency Cepstral Coefficientsmentioning
confidence: 99%
“…More precisely, for a given signal frame, it consists of a series of values, each one expressing the deviation of the signal's power spectrum from a flat shape inside a predefined frequency band. The first step of the ASF extraction is the calculation of the power spectrum of each signal frame as specified in Equation (3). In this case, the power coefficients P(k) are obtained from non-overlapping frames.…”
Section: Audio Spectrum Flatnessmentioning
confidence: 99%
“…As far as audio data is concerned, we compute the Mel Frequency Cepstral Coefficients (MFCCs) [73] corresponding to the sound of each footage segment. MFCCs have been successfully used for audio classification and retrieval schemes [74], [75] as they can represent the spectral properties of audio data in a compact fashion.…”
Section: Training Data Preparationmentioning
confidence: 99%