2016
DOI: 10.1007/978-3-319-44636-3_4
|View full text |Cite
|
Sign up to set email alerts
|

How to Correctly Evaluate an Automatic Bioacoustics Classification Method

Abstract: In this work, we introduce a more appropriate (or alternative) approach to evaluate the performance and the generalization capabilities of a framework for automatic anuran call recognition. We show that, by using the common k-folds Cross-Validation (k-CV) procedure to evaluate the expected error in a syllable-based recognition system the recognition accuracy is overestimated. To overcome this problem, and to provide a fair evaluation, we propose a new CV procedure in which the specimen information is considere… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 19 publications
1
6
0
1
Order By: Relevance
“…Each point corresponds to one sample from the mosquito wingbeat dataset in the plots, and the different colors denote the different classes. We employed 20 MFCCs and 12 LPCs to represent the mosquito wingbeat audio samples since these parameters are commonly used in literature [13], [33]. According to the t-SNE plots, LPC clusters are visually more compact but exhibiting many outliers; differently, the MFCC clusters appear more separable than LPC.…”
Section: Separability Of Mssa Dssc and Related Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Each point corresponds to one sample from the mosquito wingbeat dataset in the plots, and the different colors denote the different classes. We employed 20 MFCCs and 12 LPCs to represent the mosquito wingbeat audio samples since these parameters are commonly used in literature [13], [33]. According to the t-SNE plots, LPC clusters are visually more compact but exhibiting many outliers; differently, the MFCC clusters appear more separable than LPC.…”
Section: Separability Of Mssa Dssc and Related Methodsmentioning
confidence: 99%
“…The anuran dataset [33] consists of 60 recordings of 10 different species of frogs with varying record lengths collected under noise conditions. The number of records per species ranges from 3 to 11.…”
Section: A Datasetsmentioning
confidence: 99%
“…Mel-frequency cepstral coefficients (MFCCs) are features in the bioacoustics domain [10,24,25]. First, the STFT is calculated based on the raw signal.…”
Section: Mel-frequency Cepstral Coefficients (Mfccs)mentioning
confidence: 99%
“…Accordingly, this dataset was suitable for testing the proposed pattern recognition system. As there have been several works [18,25] already done on event detection itself, for the evaluation of the proposed recognition system, we assumed that the identified events were available to be input into the system. For the experiments, bird experts listened to the audio files and recorded the different events' start times and end times.…”
Section: Data Sourcementioning
confidence: 99%
“…Primeira etapa da avaliação: O experimento foi realizado em duas etapas, para poder avaliar o erro de generalização esperado pelo modelo de classificação (como recomendado em [Colonna et al 2016a]). A primeira etapa utiliza validação cruzada por áudio, onde a cada iteração, todos os frames pertencentes ao mesmo áudio de motosserra neste caso, são separados da classe positiva e utilizados como conjunto de teste.…”
Section: Metodologia Experimentalunclassified