2023
DOI: 10.1101/2023.03.10.531956
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep audio embeddings for vocalisation clustering

Abstract: The study of non-human animals' communication systems generally relies on the transcription of vocal sequences using a finite set of discrete units. This set is referred to as a vocal repertoire, which is specific to a species or a sub-group of a species. When conducted by human experts, the formal description of vocal repertoires can be laborious and / or biased. This motivates a computerised assistance for this procedure, for which machine learning algorithms represent a good opportunity. Unsupervised cluste… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 53 publications
0
2
0
Order By: Relevance
“…Autoencoders have been used across a broad spectrum of research in various species to analyze acoustic communication [44,45,30,46,47]. The popularity of autoencoders is partly due to their inherent ability to learn latent data structures in an unsupervised fashion, i.e., without requiring annotated datasets.…”
Section: Discussionmentioning
confidence: 99%
“…Autoencoders have been used across a broad spectrum of research in various species to analyze acoustic communication [44,45,30,46,47]. The popularity of autoencoders is partly due to their inherent ability to learn latent data structures in an unsupervised fashion, i.e., without requiring annotated datasets.…”
Section: Discussionmentioning
confidence: 99%
“…HDBSCAN has the particularity of allowing soft clustering, where each vocal unit is not assigned to a single cluster, but instead to all clusters with varying probabilities. Both UMAP and HDBSCAN have become state-of-the-art algorithms due to their performance and robustness [69,70].…”
Section: Methodsmentioning
confidence: 99%