2022
DOI: 10.1101/2022.12.13.520253
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ohun: an R package for diagnosing and optimizing automatic sound event detection

Abstract: Animal acoustic signals are widely used in diverse research areas due to the relative ease with which sounds can be registered across a wide range of taxonomic groups and research settings. However, bioacoustics research can quickly generate large data sets, which might prove challenging to analyze promptly. Although many tools are available for the automated detection of sounds, choosing the right approach can be difficult only a few tools provide a framework for evaluating detection performance. Here we pres… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…Sound files were first downsampled from 44.1 kHz to 19 kHz to improve computational efficiency. Contact calls were selected from original recordings in R v. 4.0.5 [ 38 ] by first using the ‘optimize_energy_detector’ function in the package ohun (v. 0.0.1; [ 39 ]) to apply optimized thresholds for sensitive detection of calls based on amplitude, frequency range, and duration parameters. Spectrograms were generated for a subset of these detections using the ‘spectrograms’ function in warbleR (v. 1.1.27; [ 40 ]) and then visually sorted as either ‘signal’ (contact calls) or ‘noise’ (cage noise, feather ruffling and other vocalization types).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Sound files were first downsampled from 44.1 kHz to 19 kHz to improve computational efficiency. Contact calls were selected from original recordings in R v. 4.0.5 [ 38 ] by first using the ‘optimize_energy_detector’ function in the package ohun (v. 0.0.1; [ 39 ]) to apply optimized thresholds for sensitive detection of calls based on amplitude, frequency range, and duration parameters. Spectrograms were generated for a subset of these detections using the ‘spectrograms’ function in warbleR (v. 1.1.27; [ 40 ]) and then visually sorted as either ‘signal’ (contact calls) or ‘noise’ (cage noise, feather ruffling and other vocalization types).…”
Section: Methodsmentioning
confidence: 99%
“…In addition to contact calls, budgerigars also produce a complex, highly variable and multi-syllabic song called warble, which contains elements that resemble contact calls in structure but differ in behavioural context [ 45 ]. Due to this high acoustic similarity, the random forest model included these contact call-like warble elements in its prediction of calls, requiring a quality control step in which annotations from predicted calls (formatted using the R package Rraven ; [ 46 ]) were visually inspected in Raven Pro v. 1.6 [ 47 ] so that warble elements could be removed, yielding a total of 49 573 confirmed contact calls out of the 150 534 predicted calls.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The acoustic features quantify the distribution of energy in the time and frequency domain, and the variation in dominant frequency across time (see electronic supplementary material, methods for details). Song acoustic space was quantified as the minimum spanning tree connecting all elements of the song in the overall acoustic space, using the R package PhenotypeSpace [65]. Four parameters describing song complexity were calculated for each song: element types, acoustic space, element transition diversity and between-song variation (electronic supplementary material, methods).…”
Section: (Iii) Song Complexitymentioning
confidence: 99%
“…(2021), using "R", version v.3.4.1. (52), the "Rraven" open source package(53), and RavenPro1.5(54).All bioacoustic data, aka stridulation events, were manually validated to avoid false positives/negatives and to eliminate other acoustic sources in the semi-natural conditions, such as bird vocalizations. Only stridulation activity patterns containing at least five consecutive days and nights of behavioral data were used for further analysis.Data processing and statistical analyses were conducted in Python version 3.PyCharm, JetBrains), SPSS version 21 (IBM Corp. Armonk, NY, USA), and Prism 8 (GraphPad Software, San Diego, California USA).…”
mentioning
confidence: 99%