2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8461410
|View full text |Cite
|
Sign up to set email alerts
|

Birdvox-Full-Night: A Dataset and Benchmark for Avian Flight Call Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
105
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 37 publications
(106 citation statements)
references
References 15 publications
0
105
0
1
Order By: Relevance
“…In this setting, the input to the system is a short audio clip, whose duration is equal to 150 ms. We represent this audio clip by a time-frequency representation E(t, f ). In the state-of-the-art model of [57], the matrix E(t, f ) contains the magnitudes in the mel-frequency spectrogram near time t and mel frequency f . The output to the system is a number y between 0 and 1, denoting the probability of presence of a sound event of interest.…”
Section: Methods Overviewmentioning
confidence: 99%
See 3 more Smart Citations
“…In this setting, the input to the system is a short audio clip, whose duration is equal to 150 ms. We represent this audio clip by a time-frequency representation E(t, f ). In the state-of-the-art model of [57], the matrix E(t, f ) contains the magnitudes in the mel-frequency spectrogram near time t and mel frequency f . The output to the system is a number y between 0 and 1, denoting the probability of presence of a sound event of interest.…”
Section: Methods Overviewmentioning
confidence: 99%
“…Yet, despite its simplicity and computational efficiency, such algorithms suffer from considerable shortcomings in detection accuracy, and may not be a reliable replacement for human inspection. In particular, a previous evaluation campaign showed that these detectors can exhibit precision and recall metrics both below 10% in a multi-sensor setting [57].…”
Section: Evidence Of Technical Bias In State-of-the-art Bioacoustic Dmentioning
confidence: 99%
See 2 more Smart Citations
“…However, due to the scarcity of the labeled data, only a few studies have addressed the task of vocalization segmentation and species identification using deep learning approaches. Lostanlen et al 20 released a bird flight call detection dataset along with a CNN-based benchmark. Salamon et al 21 experimentally showed that the late fusion of scores obtained from CNN (deep learning) and a random forest classifier (shallow learning) results in better performance for the task of bird species classification from flight calls.…”
Section: Introductionmentioning
confidence: 99%