Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-1857
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning for Orca Call Type Identification — A Fully Unsupervised Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…An opportunity to avoid having to manually choose the right set of features and tune their settings is to use features learnt using deep representation learning (as opposed to handcrafted features like PAFs). Following this approach, auto-encoder artificial neural networks have been used by Goffinet et al [28] on mice and zebra finch vocalisations, by Bergler et al [29] to cluster orca calls, by Rowe et al [30] to cluster bird vocalisations by species, and by Tolkova et al [31] to discriminate between background noise and bird vocalisations.…”
Section: Vocalisations Feature Extraction and Clusteringmentioning
confidence: 99%
“…An opportunity to avoid having to manually choose the right set of features and tune their settings is to use features learnt using deep representation learning (as opposed to handcrafted features like PAFs). Following this approach, auto-encoder artificial neural networks have been used by Goffinet et al [28] on mice and zebra finch vocalisations, by Bergler et al [29] to cluster orca calls, by Rowe et al [30] to cluster bird vocalisations by species, and by Tolkova et al [31] to discriminate between background noise and bird vocalisations.…”
Section: Vocalisations Feature Extraction and Clusteringmentioning
confidence: 99%
“…Contrary to supervised classifier neural networks for which the bioacoustics community often uses off-the-shelf architecture [40], there is to our knowledge not yet an architecture of choice for auto-encoder networks. Bergler et al [29] use a ResNet-18 architecture, and Goffinet et al [28] use a custom architecture consisting in successive blocks of convolution, batch normalisation and rectifier linear units (ReLU). For the encoder, these successive blocks gradually increase the number of feature maps while decreasing the spectro-temporal dimensions.…”
Section: Auto-encoder Network Architecturementioning
confidence: 99%
“…An opportunity to avoid having to manually choose the right set of features and tune their settings is to use features learnt using deep representation learning (as opposed to handcrafted features like PAFs). Following this approach, auto-encoder artificial neural networks have been used by Goffinet et al [28] on mice and zebra finch vocalisations, by Bergler et al [29] to cluster orca calls and by Tolkova et al [30] to discriminate between background noise and bird vocalisations.…”
Section: Vocalisations Feature Extraction and Clusteringmentioning
confidence: 99%
See 2 more Smart Citations