2009 Ninth IEEE International Conference on Data Mining 2009
DOI: 10.1109/icdm.2009.65
|View full text |Cite
|
Sign up to set email alerts
|

Audio Classification of Bird Species: A Statistical Manifold Approach

Abstract: Our goal is to automatically identify which species of bird is present in an audio recording using supervised learning. Devising effective algorithms for bird species classification is a preliminary step toward extracting useful ecological data from recordings collected in the field. We propose a probabilistic model for audio features within a short interval of time, then derive its Bayes risk-minimizing classifier, and show that it is closely approximated by a nearest-neighbor classifier using Kullback-Leible… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0
3

Year Published

2011
2011
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 64 publications
(42 citation statements)
references
References 19 publications
0
38
0
3
Order By: Relevance
“…29,30 BoW has become well-established in the field of computer vision, where it is also referred to as bagof-features. Recently, it has been adapted for the classification of time series in audio and speech recognition, [31][32][33] electroencephalogram (EEG) and electrocardiogram (ECG) signals, 34 and time series similarity. 35 In these works, time series are treated as text documents and sections extracted from the time series as words.…”
Section: Bag-of-words Representationmentioning
confidence: 99%
“…29,30 BoW has become well-established in the field of computer vision, where it is also referred to as bagof-features. Recently, it has been adapted for the classification of time series in audio and speech recognition, [31][32][33] electroencephalogram (EEG) and electrocardiogram (ECG) signals, 34 and time series similarity. 35 In these works, time series are treated as text documents and sections extracted from the time series as words.…”
Section: Bag-of-words Representationmentioning
confidence: 99%
“…A diferencia de la representación en el dominio de la frecuencia de Fourier, en los MFCCs la unidad de medida son los mels, en donde la escala de frecuencia se ubica de forma logarítmica, con lo cual se busca que la representación se acerque mejor a la percepción auditiva humana [34].…”
Section: Métodos De Representaciónunclassified
“…La clasificación de vocalizaciones de animales tradicionalmente se ha relegado a enfoques cualitativos, estos requieren de un gran consumo de tiempo y frecuentemente están sesgados por la subjetividad [21]. En esta revisión se usa el término detección en los problemas cuyo objetivo es localizar la ubicación temporal de la emisión de la vocalización de una especie objetivo [4], [22], [23], [34], [35], [50]. El término clasificación se usa para etiquetar un canto [17], [21], [24], [30], para ello se usa un algoritmo conocido como clasificador, cuya entrada corresponde a un conjunto de valores relacionados con atributos o características del objeto; el clasificador requiere entrenarse previamente con un conjunto de datos estadísticamente representativos.…”
Section: Métodos De Clasificaciónunclassified
See 1 more Smart Citation
“…Several methods have proven successful in correctly labeling the species of single birds in low-noise environments [4,1,2]. We propose a method of pre-processing and segmenting noisy field recordings of bird song, to isolate each bird syllable from the rest of the signal.…”
Section: Introductionmentioning
confidence: 99%